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SYNOPSIS 


DISCONTINUITIES OF OPTIMAL CONTROL 
3. Das 

Department of Mathematics, 

Indian Institute of Technology, Kanpur. 

We consider the time optimal control problem for 
non-linear systems of the form 

^ = f(x,u), (1) 

here x belongs to an n-dimensional smooth manifold M, f is 
well defined for our purpose and u is a scalar control with 
1 ^ some real a and b. Our aim is to study the 

discontinuities of the optimal control. More precisely, we 
seek answers to the following questions. What are the prirr 
factors that lead to the appearance of discontinuities in 
the optimal control and which conditions on the system put 
an upper bound on the number of these discontinuities, 
regardless of the initial and the target points? 

We initiate the study with an investigation to detem 

whether the dimensionality of the state space influences tl 

number of discontinuities of the optimal control. We show 

in systems of the form (1) — where it is possible to have 

the optimal trajectories as concatenations of the integral 

1 

curves of a countable family of C vector fields defined or 
M and where, given x e M, the set of points optimally 
reachable from x possesses a non-empty interior in M- ther< 



exists an explicit dependence of th.e number of discontinuit: 
of the optimal control on the dimensionality of the state 
space. This deipendence lies in the sense that an upper 
bound on the number of discontinuities, if it exists, cannot 
be a number less than (n~l). The analysis also enables us 
to explain the dependence of the Fel'dbaum's switching 
theorem, for linear systems, on the dimensions of the state 
space. 

In the second stage we make some geometric construe tic 
with the aim of obtaining a proper representation of the 
optimal control. For this we utilise the Pontryagin's 
maximum principle. This requires the definition of a co—st< 
vector X the Hamiltonian 


H = < ?v,f(x,u)>. 


The state and co~state variables then satisfy the Hamiltoni; 


equations 


X 


3 X 




ax 


The integral curves of the above system of eejuations lie 
in the cotangent bundle T^^. Consider all these integral 
curves along which the Pontryagin's necessary conditions 
are satisfied. An optimal trajectory in M is the projectio 
of one such integral curve in T%, v/e call this integral 
curve an ootimal path. The optimal paths lie in a subset 



Z C T%I and v/e take these as the basis of our investigation* 

It then becomes necessary to represent the optimal control 
as a function (x/X) -* u(x/X) on Z, This leads us to a 
partition of Z by the partition sets {C^}* C^f called the 
control regions, are determined by the fact that at every 
point in the optimal control is represented by a continuously 
differentiable function ]LJ.j|_(x,X) defined everywhere on C^* 

The functions collectively express the optimal control 

in the form (x,\) ■* u(x/X)* In the process we also find 
f.'Ut the factors which lead to the appearance of discontinuities 
and non— differentiabilities in the optimal control* We note 
that this analx^’sis assumes the non— singularity of the system 
with respect to the maximum principle* 

The optimal control suffers a discontinuity or a 
non— dif f crontiabil ity only when the optimal path crosses over 
from one control region into another* In the third stage of 
our analysis we consider the problem of putting an upper bouna 
on the number of such crossings which give rise to non- 
smoothness in the optimal control- To this end we derive 
a geometric condition called the transversality condition 
and shov7 that vThen a system satisfies the transversal ity 
condition the total number of discontinuities and non— 
differentiabilities suffered by the optimal control along an 
optimal path does not exceed the number (n-l). Finally, we 
prove that for linear time optimal problems, the transversality 
condition follows from the assumptions made in the Pel'dba-um's 


switching theorem 



CHAPTER 0 


INTRODUCTION 
0-1 Preliminary Remarks 

Given a mathematical description of some physical 
process to be controlled/ the objective of the optimal control 
theory is to determine the control function that steers the 
state point from a given initial subset to a given target 
subset of the s'tate space minimizing a pre-assigned cost 
functional in the process. The control problems that arise 
while obtaining mathematical models of real world physical 
problems are mostly non— linear in character. For these 
problems/ an attempt to obtain the optimal control in an 
analytic form will* in general lead us nowhere. The next best 
thing to dO/ therefore# is to gather information about the 
nature of the optimal control function. 

If one looks at the solved problems in literature on 
optimal control theory# the first thing one notices is that# 
in general/ the optimal control suffers discontinuities xfhile 
steering the state point from the initial position to the 
final one giving rise to comers in the optimal trajectory in 
the process. In cases where the optimal control escapes 
being discontinuous# one can almost always make it so by- 
distancing the boundary points farther apart. This is very 
interesting (to our way of thinking). Furthermore# it is not 
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all that easy to gain an intuitive understanding of this 
tendency to become discontinuous* For# one may be lured 
into thinking that an endeavour that accomplishes a set goal 
with a minimum effort has to be a smooth one* The purpose of 
this work is to glean as much information as we can about the 
discontinuities of the optimal control. 

At the outset we ask the following two questions ; 

— What makes the optimal control suffer discontinuities? 

— How many are they? 

The later question as stated above is/ however# a bit 
ambiguous* For# one knows that stretching the boundary points 
farther apart is very likely to result in more number of 
discontinuities of the optimal control* A precise form of 
the later question will be as follows : 

Given a set of boundary conditions# how many are the I 
nximber of discontinuities that the optimal control suffers 
^vhile steering the state from the initial to the target point? I 

The question thus formulated embodies some inherent 

un surmountable difficulties* For# an atteanpt to answer it I 

i 

will carry us into the deep and until now unfathomable v/aters I 

I 

of two point boundary value problem* We will therefore ask j 

I 

j 

a question which is not dependent on the boundary points* i 

VJhich conditions# we ask# on the system to be ' 

i 

controlled puts an upper bound on the number of discontinuities! 
of the optimal control# no matter how far we travel along any i 
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optimal trajectory from any point in the state space? 

In seeking an answer to the above question/ v/e do 
not propose to exhaust all possibilities* We will not be 
able to* When the state equations are non-linear/ we 
believe/ there are many ways of putting an upper bound on 
the number of discontinuities of the optimal control* In 
our attempt to find conditions for an upper bound v/e will 
take inspiration from the results due to A, A, Feldbaiim* 
Consider the time optimal control problem for linear systems 
of the form 

X = Ax + bu -1 < u £_ 1 (l*) 

where x s 3R^, A is an nxn real matrix and b is an nxl real 
vector* The optimal control has been found to be piecev/ise 
constant/ it takes on the values +1 and -1 alternately- 
suffering discontinuities in the process* Fel’dbaum [1/ p*120j 
2/ p.l43j has shown that in systems where A, has all real 
eigenvalues/ the optimal control does not suffer more than 
(n-1) discontinuities no matter how far the final point 
lies from the initial point. The above linear problem is 
a particular case of the more general non-linear problem we 
propose to investigate and our aim is to explore the 
possibilities for obtaining similar results for the non- 
linear problem* 

The optimal control theory/ as its history shows./ 
has arisen out of problems that are of interest in various 
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branches of science and engineering* The brachi stochrone 
problem led to the foundation of the theory of calculus of 
variations. More recently Lawden'^s work [7] on the 
optimal navigation of a spacecraft from one orbit to another 
led to the singular optimal control problem* Our own 
interest in the discontinuities of the optimal control also 
stems from the Lawden^s problem* We want to know whether 
there exists a bound on the number of impulses used in the 
optimal orbital transfer (Lawden [7] / Marchal [9] / Small fio]/ 
Edelbaum ^20])* This problem/ however/ will not be taken up 
in this work. 

0-2 Method of Investigation 

Before looking at how we propose to get on v;ith the 
analysis/ let us say something about the tools we are going 
to use* NOW/ a standard assumption in modem control theory 
is that the state space is a vector space. This is very 
natural and valid in many situations. Hov/ever/ as pointed 
out by Brockett [ll^z there are very natural control problCTis 
in engineering and physics where the state space fails to 
qualify as a vector space* These problems are therefore not 
treated by classical control theory. To obviate this 
difficulty we take the state space in our analysis to be a 
smooth manifold and use the methods of differential geometr^^ 
for the investigation* A smooth manifold is only a generali 
zation of - in the sense that/ locally/ it looks li^e 
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— in which it, is possible to define the process of 
differentiation. Moreover^ this generalization does in no 
way complicate the nature of the control problem. On the 
other hand, methods of differential geometary, being compact/ 
give an easy intuitive feeling of the dynamical behaviour of 
the system under consideration. With regard to the theory 
of non-linear systems, it is now an established fact that 
modern differential geometry is uniquely suited to its 
development [12] • Recent work in geometric control theory 
by Brockett, Elliott, Krener, Sussmann and others attests to 
the usefulness of geometric methods [ 8j] . 

In this work we adhere to the terminology and 
definitions as given by Michael Spivak in his delightful book 
on differential geometry [3]* 

0—3 A View of the Analysis and a Review of the literature 

We propose to consider the time optimal control 
problem for non—linear systems of the form 

||=£(x,u5, (0-l> 

where x belongs to an n—dimensional smooth manifold M, f is 
well defined for our purpose and u is a scalar control wiJ:i 
a < u < b for some real a and b. The investigation is 
planned in three stages. 

Looking back at Feldbaum's switching theorem for 
that the bound on the number of 


linear systems we see 
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discontinuities/ when it exists/ depends upon the dimensions 
of the state space. Why does this dimensionality influence 
the number of discontinuities? In the first stage of our 
analysis v/e answer this guestion. The answer lies in 
recognizing the fact that when the state space M is paracompact 
and n > 1 / the set of points reachable from some p s M along 
the integral curve of a single vector field defined on M 
docs not possess an interior. To endow the reachable set 
from p ’with an intorioio one has to have more than one vector 
field defined on M and be able to switch from the integral 
curve of one vector field to that of another. We show that 
when the totality of vector fields at our disposal is 
countable (it is so in Feldbaum^s case? in fact there are 
only two vector fields to manoeuvre with) the upper bound/ 
if it exists/ on the niimber of such switchings — required to 
make the reachable set possess a non-empty interior - is a 
number not less than (n— 1). And this is where the dimension- 
ality of the state space comes into picture. 

That the dimensionality of the state space is a 
cfoverning factor in choosing a family D of vector fields in 
M, such that the set of points D-roachable from p s M (i.o./ 
points in M reachable from p along the integral curve of 
some vector field in D or along a concatenation of such 
integral curves? going backwards in time not being allowed) 
possesses a non-empty interior/ has been shown by r.'ussmann and 
Jurdjovic [13] in a comprehensive paper on controllability. 
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In a later paper/ Krener [14] generalizes some of these 
results. In yet another paper Levitt and Sussmann [15] have 

shown that on every connected paracompact manifold M of class 

k k-1 

C / k > 2/ there exists a set S consisting of tvra C vector 

fields on M such that the set of points S—roachable from any 

p e M is the entire manifold M and that it is possible to 

have a bound on the number of switchings. In this context 

results by Sussmann on local controllability arc also 

interesting. 

‘ In the second stage we propose to build a proper 
framework to facilitate the analysis of the discontinuities 
of the optimal control. To this end we exploit the Pontryagin s 
maximum principle. Pontryagin" s principle converts the optimal 
control problem into a problcam in Hamiltonian control systems 
by making it necessary to define an n-dimensional co-state 
vector X .' Now Hamiltonian control systems are constructed 
on cotangent bundle of the state space. We therefore take 
the cotangent bundle of M rather than M itself as our base of 
operation. This immediately reguires the representation of 
the optimal control as a function bn the cotangent bundle in 
the form (x/X) -* u(x,X)- In this context brief comments by 
Sussmann in one of his papers [ 17 ] are quite interesting. 

In the process of obtaining the optimal control in the form 
Cx/X) ■* u(x/X)/ vre also find out the reasons which compel the 
optimal control to suffer discontinuities and non-differentia- 
bilities. 
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In the third stage we take up the question of putting 
an upper bound on the number of discontinuities of the 
optimal control. As far as we know^ except for Fe]!dbaum^s 
switching theorem there is little literature on optimal 
control theory that has direct bearing on this aspect* Hajek 
in his paper fl8j extends’ the Fel’dbavim's theorem* He 
considers linear problems of the form (l) with no restrictions 
on the eigenvalues of A and shows that for every such problem 
there exists a fundamental length e > o with the property that 
the optimal control in any time interval of length less than 
e suffers at most (n-l) discontinuities. In [19] Sussmann 
studies systems of the form 

X = f(x) + u g(x) / 

He proves a bang-bang theorem vrith a priori bounds on the 
number of switchings* 

In our analysis# we actually put a bound on the total 
number of discontinuities and non— differentiabilities of the 
optimal control. We show that when System (0,1) satisfies a 
certain geometric condition called the transversality ! 

condition# the total number of discontinuities and non— i 

I 

! 

differentiabilities suffered by the optimal control governing ■ 
any optimal trajectory in M does, not exceed (n-1) no matter i 
how long the time intearval is. Finally# we prove that the | 

Ldnear System (L) satisfies the transversality condition 

i 

when A has all real eigenvalues. i 

I 

1 



CHAPTER 1 


PREPARATORY ANALYSIS 
—1 Problo". Statement 

We consider the time optimal control problem for 
nonlinear systems of the form 

= f(x( t) ,u(t) ) . (1.1) 

C50. 

Here x belongs to an .n~ dimensional smooth (C ) manifold M 
and the control u belongs to a class U of scalar admissible 
controls. We define U as follows : 

If V e u then 

(i) V;i-.Q^3Ris piecewise continuous where I ^ [0,«>) 
is an interval and q = [a,b]. 

(ii) The discontinuities of V are of the first kind, i.©*/ 
wo assume the existence of the limits 

O'Ct-O) = lim y(t) and y(T+0) = lim i'(t) 
t-* T t-» T 

t< T t> T 

at a point of discontinuity t . 

To avoid confusion/ we take the value of v at a point of 

discontinuity to be its right hand limit/ i.e./ 

v{x) = y(T+0). 

The mapping 

f ; M X Q TM 

(TM is the tangent bundle of m) is twice continuously differen- 
tiable with respect to x and u. 
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Given a control function t -* uCt) defined on an 
interval I, a trajectory governed by u(t) is a curve 
X : I M that satisfies 

x(t) = f(x(t)/ u(t)) 

for almost all t 6 I* Given the System (1*1) with an initial 
point X e M and a target point y s M/ we say that the control 
u : [O/T] -* Q steers the system from x to y, in time T, if 
there exists a trajectory x ; [O/T] M governed by it which 
satisfies x(0) = x and x(t) = y. Furthermore, of all the 
admissible controls steering the system from x to y, the 
control u which minimizes the transit time will be called 
the optimal control and we ass\ime that it is unique* The 
trajectory governed by an optimal control will be called an 
optimal trajectory. 

As indicated in the introduction, our aim is to study 
the discontinuities of the optimal control. 

1-2 Dependence of the Number of Discontinuities on the 
Dimensionality of M. 

In this section we direct our investigation to study 
the dependence, if any, of the number of discontinuities of 
the optimal control on the dimensionality of the state space 
M. Dor this purpose we choose a special system from among 
the systems of the form (1.1). We describe this system 


below. 
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First of all we assume M to be paracompact. Let y S M 
and let M(y) ^ M be the set of all points in M from which 
there exist optimal trajectories to y. Next we assume that 
for every y e m there exists a function Vy(x)^ 

Vy : G(y) -* 3R, 

defined on some open subset G(y) C M such that G(y) 3 M(y) 
and Vy when restricted to M(y) acts as the synthesizing 
function with respect to the target y» In other words the 
integral curve of the equation 

^ = f(x/ Vy(x)) (1.2) 

from any point x e(M(y)-£y}) yields the optimal trajectory 
from X to y. In general Vy(x) is a discontinuous function. 

We assTime that it is differentiable everywhere except, 
of course, at the points of discontinuity. The number of 
discontinuities that Vy(x) suffers while steering the System 
(1.2) from X to y is equal to the number of discontinuities 
of the optimal control u(t), which governs the optimal 
trajectory from x to y, in the optimal time interval. We 
therefore focus our attention on the discontinuities of 
Vy(x) instead. 

It would be useful to interpret the discontinuities 
of Vy(x) in the following way. As we start from x and proceed 
along the optimal trajectory, ■Vy(x) varies smoothly until a 
point of discontinuity is reached, where it undergoes a jump 
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and then varies smoothly again# In other words Vy(x) switches 
from one differentiable function to another suffering a 
discontinuity in the process and this process possibly gets 
repeated a number of times before y is reached# 


More precisely let DCm) be the set of all differentiable 


functions defined on M# Wo assume that there exists a 
partition of(M(y)— {y}) by the partition sets {S^}/ say^ such 


that 

(i) 


are connected and for each it is possible to find 
a function Vy(x) s D(M), such that 

Vy{x) = Vy(x)/ V X e - / 


(ii) and that are maximal with respect to the above 

property^ i#e#/ there does not exist a connected subset 
of(M(y)~{y}) and a function h(x) e D(m) with the 
property that and 


h(x) = Vy(x) V X e a^. 

We denote by D^Cm) the collection of all functions {v^Cx)}# 
(Note ; For an it is possible to find more than one 
function belonging to D(M) which can serve as v^Cx)# However, 
in Dy(M) we include one differentiable function for each S^) # 

The functions Vy(x) are different for different y e M, 
so also are the subsets DyCM). We further assume that 
there exists a countable subset D'(M) C such that 

'Dy(H) CD'(M) for all y e M# For each g(x) s D'{m) the 
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mapping f(x, g(x)) gives rise to a vector field on M* The 

collection D'(m) therefore gives rise to a countable family 
1 

of C vector fields on We denote this family by V and by 
Vy we denote the family of vector fields corresponding to the 
functions- in D^Cm) , 

'The above discussion describes the special case of the 
System (l.l) we have chosen for analysis in this section# 

We denote this system by S. S is roughly a subsystem of the 
family of vector fields V defined on M# With this in mind we 
define a few terms. 

Definition 1 

( 

An_ integral curve of V is a mapping a from a real 
interval into M such that there exists a division 

^ ^ t^ ^ ^ and elements ^ . y of V 

with the property that the restriction of a to [’’'i-i^’^il 
an integral curve of for each i = 1^2, .../k. 

? nition 2 

Let X e M. A point y s m is V- reachable from x if 
there exists T > 0 and an integral curve a : [OyT^ •* M of V 
such that aCo) = x and a(T) = y. Let the set of all points 
V~ reachable from x be denoted by L (V) • 

Definition 3 

Let <X M be an integral cuive of V. We say 

that t e is a svjitching point of a if a is an integral 
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curve of X. s V in (t-g, t] and an integral curve of X. s V . 
in [t/t+e) for some 6 / s > 0 and X^ / Xj. 

Definition 4 

Let I denote the m- tuple of integers (i^^ /i 2 / • • • • 

Wc say that a point y e m is V-reachable from x with the 

switching mode I if there exist vector fields X. s VJ 

k = 1/2/. •.^m/ an integral curve a t [O/T] •» M of V for some 

T > 0 and a division 0 = tQ< tj^'^ •••< tj^ = T/ such that 

(X(0) = X/ a(T) = y and a when restricted to the interval 

field X. . 

^k' 

Definition 5 

Let X e M and m be an integer* A point y e M is said 
to be V-reachable from x with m s\i?itchings if there exists 
an integral curve a s [O/T] -» M of V such that a(0) = X/ 
a(T) = y and the interval (0,T) contains m switching points 
of a* , 

Given a vector field X^ 6 V/ we denote by f^(t/*) the 
one-parameter group of diffeomorphisms generated by X^ 
[3/P.5-21] * The following theorem is central to the 

analysis in this chapter. 

Theorem 1 

Let V be a countable set of C vector fields defined 
on an n— dimensional manifold M. Let x £ M and nave 

a non-empty interior- If there exists an integer m such that 
all the points in are V-reachable from x with less than 


is an integral curve of the vector 
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Of equal to m switchings then m > n-1 • 


Proof. Suppose that the integer m exists. For an integer i 
vro denote by L^(V,i) the set of all points in M/ V— reachable 
from X with i switchings. It should be noted that a point 
y s M can be reached from x along different integral curves, 
of V with different niimber of switchings. The sets L (V,i), 
i = 1/2/.../ are therefore not necessarily all disjoint. 
Clearly 


m 


^ L (V,i). 
i=0 

has a non- zero measure [ 5/p.33j • It therefore follows 
that there exists an integer Ic/ 0 < k < m/ for which L CV/k) 

mam ^ 

has a non- zero measure. Let denote the k+1- tuple of 

integers (ij_/i 2 ^ . . We denote by L^( V,k,I^‘‘‘^ ) the 

set of all points in M, V-reachable from x with the switching 
. mode (Def. 4). Clearly 

L^Cv^k^I^"^^) C L^(V^k) 

Moreover the set L^( V^k/I^"*"^) corresponds to the collection 

{X. ,X. /.../X. } of (k+1) vector fields belonging to V/ 

1 2 k+1 

in the order given. We note that there is no need for any 
two of the integers ij_ # i 2 / • • • .r / except when they are 
consecutive, to be different. Now V contains a countable 
number of vector fields. It is therefore possible to have 
only a countable niimber of different collections of vector 
• fields where each collection consists of (k+1) vector fields 



16 


(not all different) belonging to V« Hence there are only a 
co\antable number of sets such as L (V/k,!^'^^) and their 
union is b^(V#k) which has a non~zero measure# We therefore 
conclude that there exists a switching mode J corresponding 
to' the (k+l) -tuple ^ 3i / ^2 ' * * * ' ^k+l ^ such that the set 
Ij^(V,k,J ) has a non-zero measure* 

C L^(V^k) C L^(V) C M 
The points of L^Cv^kj, ) are of the form 


^Jk+1 


for some real * **^^k+l' positive. Let the subset 

V4.I 

A C3 ^ he such that 

( i ) ( t^ f t2 1 • • • ! ® A ==C> tj|^ ^ 0 ^ t2 ^ 0 / # # • / ^ 0 

k+l 

(ii) A consists of all points in HR at which the function 
p given by 

P ( t^ , t2 / • • • jr ) 


^k+l 
is defined# 


,k+l 


Clearly A is an open subset of 3R and 
P(A) = L (V.k^J^’^^) 

Since the vector fields X. /X. /#*»/X. are all C — 

Jl H ^k+1 

differentiable, the mapping P : A •* M is also C^-diff erential; 
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Ic+l 

P maps a submanifold of IR onto a subset of M having a 
non- zero measure* We therefore conclude that 


h+1 > n ==> k > n-1 
==> m > n-1 • 

(Let Xand Y be differentiable manifolds of dimensions n and 
m respectively with n < m* Let f ; X -♦ Y be C^-differentiable/ 
then f(x) has measure zero i-n Y* See [ 5^p*31] for a proof*) 
This completes our proof* 

Now a few remarks concerning this theorem : 

Remark 1* Let D denote a set of complete analytic vector 
fields defined on M. A vector field X is complete if the 
integral curves of X are defined for all real t* Let t(d) 
be the smallest Lie algebra of analytic vector fields on M 
which contains D (the bracket operation being the Lie product 
of vector fields). Sussmann and Jurdjevic Il3] have shovm 
that a necessary and sufficient condition for to have a 

non-empty interior in M is that dim T(D)(x) = dim where 
T(D)(x) is the subspace of spanned by the vectors X(x)/ 

X e T(D). 

Remark 2. It is clear that the above theorem revolves around i 
the fact that L (V) has a non-empty interior* By assuming 
this we have ensured that the reachable set has an n— 
dimensional freedom* That this dimensional extension gives : 

I 

rise to the necessity of at least (n— 1) switchings/ can be 



18 


realized intuitively also* For^ X'^ere we to allow no switchings/ 
the reachable set from the point x x-rill clearly be one of a 
single dimension (excluding the point x)/ being the countable 
union of smooth integral cuarves of different manbers of V 
through the point x? and if one switching were allowed^ the 
reachable set would at best be of two dimensions* Sxvitchings 
are therefore necessary to increase the dimensions of the 
reachable set. 

The optimal trajectories of the system S are nothing 
but integral curves of V« "While proving the theorenn/ above^ we 
had the freedom to switch from any member of V to any other 
member at any instant. Let this type of switchings be called 
"free switchings". 

Consider the optimal problem for the system S/ in an 
attempt to reach a point y from x (x/y s m) optimally/ we can 
not afford free switchings/ for/ we have to satisfy the 
constraints of optimality. Let a : [O/T] -* M be the optimal 
trajectory from x to y for the system S. a is an integral 
curve of V, in particular of and [O/T] is the optimal 
time interval. Let the function u(t) : [o,t] -• 3R be the 
governing optimal control. Now it may be possible to construct 
a in different ways using different subsets of V, However, 
considering the way Vy is constructed there exists a way of 
constructing a using vector fields belonging to such that 
every switching point ^ e (0,T) of a is a point of disconti- 
nuity of u(t). We consider all optimal trajectories of the 
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system S to have been constructed in this way and by "optimal 
switchings" we mean- the switchings encountered along an 
optimal trajectory. Optimal sv/itchings are therefore a 
particular case of free switchings. We have therefore proved 
the following corollary. 

Corollary 

Given any x e M let the set of points reachable from x 
along optimal trajectories of S have a non-empty interior. 

Let N be the set of integers such that k e N *==> there exists 
an optimal trajectory of S along which the optimal control 
suffers k discontinuities and v;e take into account all 
possible optimal trajectories of the system S while construc- 
ting N. Then the least upperbound of the set N, if it exists/ 
is a number greater than or equal to (n-1) where n is the 
number of dimensions of the state space M. 

The above analysis also explains the reason why the 
optimal control for the system S suffers discontinuity. 
Discontinuities are necessary to increase the dimensionality 
of the optimally reachable set. 

To illustrate the significance of the corollary let us ; 
consider the linear time optimal problems of the form | 

' i 

X = Ax + bu / — 1 ^ u < 1 (L) I 

Here x e IR^/ A is an nxn real matrix and b an nxl real vectors 
Wg asstirriG (L) is contiroll3.t)ls# An assGirtion o-bout the 
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number of discontinuities of the optimal control is spelled 
out in Fel^dbaum'^s switching theorem [l,p«l20] • We state 
it as follows : 

Lot all the eigenvalues of A bo real* Then/ the optimal 
control is piecewise constant/ takes on only the values +1 and 
— 1 and does not have more than (n—l) discontinuities* 

If/ on the other hand/ A has complex eigenvajlues it is 
not possible to put a bound on the number of discontinuities 

[18]. 

The optimal trajectories of (L) are nothing but integral 
curves of V = { 2^ /2;2^ r ^ ^2 vector fields defined 

on IR^ by the rule 

Z^Cx) = Ax + b and = Axrb* 

Controllability of L ensures that given x e 3R^, the set of 
points reachable from x along optimal trajectories of L has a 
non-empty interior - in fact the set is equal to The 

above corollary now applicable to (L) explains the dependence 
of Fel'dbaum's theorem on the dimensions of the state spaco/ 
in particular the dependence on the number (n-l). A number 
less than (n-l) can never be an upper bound and it so happens 
that/ when the eigenvalues of A are all real a point y e 
is V- reachable from x with no more than (n-l) switchings [18 ]« 

A Theorem on Free Switchings* 

In the xiTake of Theorem 1 the following question arises 
naturally* Is it possible to find a countable family V of 
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vector fields on M such that all the points in 
V- reachable from x with no more than (n-l) switchings? In 
the discussion for linear systems we saw that if V = (Z^,Z 2 } 
and all the eigenvalues of A are real/ then V possesses this 
property. We also believe/ there are many ways of choosing 
a countable collection/ possessing the above property/ from 
the set of all non-linear vector fields defined on M» One 
such collection for example/ is the set V whose elements give 
rise to a co-ordinate system on M* We conclude this chapter 
with a theorem which proves this intuitively obvious fact. 

Theorem 2 

Let M be an n-dimensional analytic manifold. Let V be 
the set of n. vector fields /!X 2 / .• • defined on M which 
satisfy/ the follov/ing conditions s 

(i) are analytic and complete/ i =1/2 /.../n. 

(ii) are linearly independent everywhere on M. 

(iii) [Xj_/Xj] = 0 on M for 1 < i/ j 1 n. 

'Phen a point y e L^(V) is V- reachable from x with no more 
than Cn-1) switchings. 

Proof. We note tliat L^<V) has a non-empty interior for all 
X e M [13]. Condition (iii) further ensures the commutati- 
vity of the vector fields in V. If X^ and X^. belong to V and 
generate the one-parameter groups of diffeomorphians §^(t/.) 
and Sjt't/.) respectively then for any x s M 


1 
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M 


Lot y e L^(v), Then there is an integral curve t -* a^(t) 
of V vrhich connects x to y« Let this integral curve bo of 
the form 

X = 03^(0) 

^ hc-l ' • • - / ( t3_ , x) . . . ) ) 

corresponding to the vector fields X. /X. y...^X. in V. 

^2 

If = Xj^ , 1 < m < k, then from Equation (*) it follo\'/s 

■^1 m 

that there exists an integral curve of connecting x to 
Yt which has the form 

X =^ 2 ^ 0 ) 

y = ajCt^+tj+.-.+t,^) 


• • • ' ‘ Wl ' ^ *"111-1 ' * • • ' *3^ ^ 


•••})•••) 


Applying the above technique repeatedly we can therefore find 
an integral curve a of V connecting x to y such that 

(a) a( 0 ) = X, a(t3_*i-t2+...+t3^) = y and 

(b) given X^ s V, a is an integral curve of in a connected 
subinterval of [o, t3_+t2+» • •tt^^ ] • 

However as there are only n vector fields in V, it follows that 
a can contain at most (n- 1 ) switchings. This concludes the 
proof and the chapter. 



CHAPTER 2 
CONTROL REGIONS 

2—1 Preliminaries 

In this chapter we make some geometric constructions 
with a view to facilitate the interpretation of discontinuities 
of the optimal control. We consider the time optimal control 
problem for systems 

uCt)) (2.1) 

as defined in Section (1-1). 

As f(x(t),u(t)) is nonlinearv finding the optimal 
control t -» u(t) and the corresponding optimal trajectory 
for the System (2.1) tantamounts to solving a two point 
nonlinear boundary value problem. We will not attempt anytliing 
of this kind. Given the initial and the target points in the ; 
state space M we assume tliat a unique optimal control exists i 
for the system and therefore satisfies the- necessary condition! 

of Pontryagin's maximum principle. We exploit this principle ; 

i 

to our advantavge# i 

Th6 rnaxirnuin principla first of all raquiras dsfining I 

‘ I 

a costate variable as follovjs : i 

Let (x/V) be a co-ordinate system around a point p e 
Consider the mapping f : M ^ 0 IM. Let a e then the | 

rastriction of f to I 
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f ; Mx {a} -* TM 

is a vector field on M, Relative to the coordinates 

can write 


£( g/a) 


n 

S 

i=l 


fid /a) 



V 


g e V. 


Let uCt) be an admissible control defined on an interval I* 
Lot X : I ■» M be a trajectory governed by uCt). Ihen the 
costate variable for the pair (x(t)/u(t)) is a map t •* A(t) 
defined for all t s i such that X(t) e for all t e I, 

and satisfies the costate equation 


dX. n 


9f j(x/u) 




( 2 . 2 ) 


n 

v;hore X = E X. dx.. • 
i=l ^ ^ ■ 


The system of Equations (2.2) are linear and homogeneous 
and therefore have a unique solution through any initial 
point. 


Definition 1 

Given an admissible control u = u(t) defined on an 
interval 1/ a curve (x/X) ; I -*■ T^ that satisfies Equations 
(2.1) and (2.2) (with the given control) for almost all 
will be called an integral path governed by u(t). 

It is clear that trajectories and integral paths 
governed by admissible controls are continuous/ and 


t e I 
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differentiable almost everywhere, in the domain of definition. 
The trajectories in M are natural projections of integral 
paths in T^M. Bj natural projection we always mean the 
pro j ection 

71 : -* M 

associated with the fibre bundle Let us represent an 

element of T’'M by the symbol [x,X] then u [x/X] » x s M and 
X is the corresponding cotangent vector in M'**'. (Note : A 

IK! ’ 

closed interval in the real line is also represented by 
square brackets e.g. the interval [a,b] • This, however, will 
not create any confusion as the context will clarify the 
symbol • ) 

The Hamiltonian for the System (2.1) is a function 
H I A X Q ^ 3R 
defined by the relation 

H( [x,X]/u) =<X,f(x,u)>. (2.3) 

The Pontryagin's maximum principle says: 
in order that an admissible control u(t) and a trajectory 
x(t) governed by it be time optimal, it is necessary that 
there exists a non-trivial costate variable X(t) corresponding 
to the pair (x(t),u(t)) such that 

H( [x(t)/X(t)] ,u(t)) = max H( [x(t),X(t) ] ,v) 

vSQ 


(2.4) 
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and 

H( [x(t) ,X(t)] / u(t)) > 0 (2.4)' 

for almost all t in the domain of interest* 

Definition 2 

An integral path t -* [x(t)/X(t)] s governed by 

the control u(t) will be called an extremal integral path 
if ACt) 0 V t e I, and if for the pair ([x(t)/X(t)] ,u(t)) 
Relations (2.4) and (2.4)' are satisfied almost everywhere 
on where I ds the domain of definition of the integral path. 

It now follows from the maximum principle that an 
optimal trajectory in M is the projection under Tt of some 
extremal integral path in We therefore make the 

following definition. 


Definition 3 


An extremal integral path in T% whose projection 
under 7i is an optimal trajectory in M will be called an 
optimal path . 

Clearly the control u(t) which governs an optimal 


path is an optimal control. 

Now consider an optimal path defined on an interval 
I, Let 'Tel be the instant when the optimal path passes 
through the point [ X.X ] ^ ^M. Lot e q be the value 
of the governing optimal control at t - • Then the 

following equality is satisfied 


4 
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H([x/X]/V) =5 max H([x/X|/v) (2*5) 

VSQ 

This relation has nothing to do with time. What we want to 
stress is that whenever an optimal path passes through a 
point [x^X] e T^My the value of the optimal control there 
depends only on the point in question. The value of time 
when we reach tlie point in question is immaterial. For^ we 
may start from different points lying on the same optimal 
path/ corresponding to different initial points on the same 
optimal trajectory/ and reach [x/X] at different epochs^ in 
all cases/ however/ we reach the point [x/X] with the same 
value of the optimal control. If therefore we were to taXe 
the optimal path rather than the optimal trajectory as our 
basis of investigation/ it wuld be wiser to seek the optimal 
control in the ' form 

u s T^ 3R. (2.6) 

We encounter a few difficulties here. It may not be 
possible to have an optimal path through every point in T%« 
For example, there may exist points [X/X] s T% such that 
H([x, X] /v) < 0 for all v e [a/b] • Owing to the relation 
(2.4)'/ no optimal paths can pass through such points. Let 
Z C. T**^ be the maximal subset such that it is possible to 
have an optimal path through every point in Z. The optimal 
control should therefore be sought in the form 


u : Z 


( 2 . 6 )' 
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Finally, let us make ourselves clear with regard to 
an important limitation before proceeding further. Our aim/ 
ac. we have said before, is to study the discontinuities of 
the optimal control to which end we are interested in 
obtaining the optimal control in the form (2.6)'* We plan 
to do this with the help of the maximum principle. Now the 
maximum principle, being a first order necessary condition^ 
will not always be able to specify the optimal control 
uniquely/^ even if we were able to ascertain its existence a 
priori. At times it may be possible to get over this 
difficulty with the help of higher order necessary conditions. 
We however, will not get into these complications as they 
have a distinctly different flavour from what we aim at. We 
call an optimal control problem * singular v/ith respect to 
tViG maximum principle*' or in short ** singular" if there 
ai-ise situations where it is not possible to obtain definite 
information about the optimal contix)! (when it exists), in 
terms of state and co— state variables, through the maximum 
principle. Henceforth our analysis will be confined to non— 
singular problems only. cn^ 

2-2 Extremal Surfaces 

The Hamiltonian is twice continuously differentiable 

vrith respect to the parameter ue q = [a,b]. Therefore 

aH([xA]/u) 

the partial derivative is defined for all • 

[x,X] e T%I and for all u e ^t u = a(or b), the 
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smoothness of H means that it can be extended to a smooth 
function H defined in an open neighbourhood of a (or b) e 


3H([x/?v] ,u) 


a u 


u=a(or b) 


- 


u=a(or b) 


to be the derivative 


• It can be sho\>m that this definition 


we define 

3'hC[x/X] /u) 

§u 

yields a unique-value of the derivative at u = a (or b) . For 
the sake of convenience we v/rite 


3H([xA]/u) 


= h ([xA]/u) s X [a>b] -» (2,7) 


For a while let us suppose that the global mascimum 
of H([x,\JiU) considered as a function of u in [a,b] is 
attained in the interior (a^b)* If then we succeed in 
obtaining the optimal control in the form [x,X] u( [x»X})x 


the ocyaation 

h([x,X]/ u([x/X])) = 0 

should necessarily ho satisfied for all [x/X] s Z. One way 
therefore, to find the optimal control in the form (2.6)' 
is to seek an explicit solution in .the form u = u([x/X]|) for 
the equation 

h(£x,X]/n) = 0. (2.8) 

Lot [x ] be a point in the cotangent bundle of M. 
suppose the- Hamiltonian considered as a function 

of u has only non-degenerate critical points in the Interval 
[a,b]. Since the non-degenerate critical points are isolated, 
only a finite number of such points can occur in the compact 
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/ 


interval [a,b] • Let the points "^02 ' * * * '^0 *■' 

open interval (a^b) be the non— degenerate critical points 
of H([Xq,\q]^u) . Then at each point J ,Uq^) e (a,b)j 

i = 1/2,...^)!,^ we have 

(i) /Uq^) =0 i = } 

(ii) Uq^) / 0 i = 1,2....^ )l . 

Moreover h( [x*X] #u) possesses continuous partial derivatives 
with respect to all its arguments. It then immediately 
follows from the implicit function theorem that for each 
Up^/ i =5 1,2/.../ ^4 there exists an open set CJ^ in 
containing the point such that the equation 

h([x/X]#u) a O has a unique continuously differentiable 
solution u e= u^([x/X]) defined on which satisfies 

"Oi “ 

For a possible optimal path through the value 

of the optimal control there is given by the value at ] 

of one of the functions {u^( ]]x/X])/...iUj^( [x/X])} • The 
appropriate function Uj is determined by the relation 

i = / S,5 i j 

Tlie Relation (2.9) has to undergo a slight modification 
if now we reject the ass^lmption that the global maximura of 
H([x/X]/u) considered as a function of u in [a,b] is attained 
in the interior (a^b). To see this# define two constant 
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functions u and u on I^M by the relations 

= a j [xA] ) = b (2.10) 

Tlie value of the optimal control at [x^, \Jwill then foe 
given by the function U|^([x^A]), for some integer k in 
1 < k < Jl+2/ determined by the relation 

H([xq,Xo]/ ^ [Xq/Xq]))? (241) 

i = 1^?!^..., Jl+2/ i ^ k. 

We say that the function u^^ defined on the open set 
represents the optimal control at * 

A tacit assumption that u) consid.ered as 

a function of u has a unique global maximum in [a^b] has been 
made while writing the Relation (2.11). At a later stage we 
analyse the case when the above function has multiple global 

maxima in [a#b]. 

We say that the function ^ : S -» IR/ S C I** 
represents tlie optimal control in a subset a CS if for 
every [x,X] e d 

a < i^([x^X]) < b 

and 

H([xA]/ J^C[xA])) = H([xA].v) 

Relation (2.11) however implies a stronger result xf 
we assume that either the points a.b are not critical points 



of the function H ( y u) or if they are critical points 

they are non-degenerate and the corresponding solutions 
( through implicit function theorem) to the equation 
h( [x/X]yu) = 0 are given by u = u [x/X]) and u ='U 
We proceed to obtain this result in the Lemma below 

Lemma 1 


If tlie function u^( [xyX]) represents the optimal control 
at ® then there exists a neighbourhood N of 

[xj^/Xq] such that [x/X]) represents the optimal control 
at every point N. 


Proof* We must show that if U]^([xiX]) satisfies the 
Relation (2*11) then there exists a neighbourhood N of 
such that for all [x,x] e N 


( i ) a < uj^( [x/ X]) < b 

(ii) H( [x,X]yU, ([xyX] )) = max H([xyX]/v)_ 

ve [ayb] 


(A) 


Let H([xQ/Xjy ■a]^C[xQ/X.Q]))-H([xQ,XQ],Uj^( [Xq/Xq])) « i k. 

Then 6^ > 0 as is evident from Relation (2*11)* Let 11 be a 
neighbourhood of f Xq/X^]) such that it is contained in all 
Let Nj.cn/ i = 1/2/.../ )l+2, be neighbourhoods of 
Cxq/Xq] such that for all [x/X] e N^, i = 1,2/ *.*/ fi,+2/ we 
have 


(i) a < u^( [x/X]) < b 

6i 
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and 

I ))-H( [x/X], u^([x,X]))i < 3^ • 

ThesG are possible because and H are continuous functions 
on For all 8 we therefore have 

H([x.X]/U3^([xA])) -- H(tx^X], u^([x,X])) 

~1 H( [Xq/\ J )-H( [xA] /U3^([x/X]) ) 1 

- lH([x/X]y Uj_(j[x,X])) - H([x^#X^] , u^([xq/X^])) I 

K S-i 

> 6 ^ . ^ ^ > 0 

or H( [x/X] /Uj^( [x,X]) ) > H( [x/X] /U^( [x/X]) ) ¥ [x/X] e N^. 

Let N » n N 2 n r V n 0 \+i a •••a 

all [x/X] e N WG have 

( i) a < u^( [x/X] ) < b 

(B) 

(ii) H( [X/X],U^( [xyX]) ) > H([x/X]y Uj^CCxyX])) 

1 ® 1/2/*** / J 1 X • 

Now the functions Uj^([x,X])/ i = 1/2/*. i. t satisfy 
the equation h ( [xyX] /u) = 0 at all points [xyXje n. Therefore 
for every [x,X] ® N the values of u at which H( [x,X3/u) 

considered as a function of u in [a,b] attains its extirCTie 
values are determined by the functions £u^} . That all the 
extreme values are accounted for can be ensured by choosing 
N sufficiently small* Therefore Relations ( b) imply Relations 
(A). 
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We now make some geometric constaructions* Take an 
arbitrary point [x%k'] e Let the points 

the closed interval [a/b] be the non— degenerate critical 
points of H( J /u) considered as a function of u» Then 


as we have seen before, for each u^, i = 1,2,.../ there 
exists an open set in containing the point Cx'',?v.'3 
such that the equation h( [x, == 0 has a unique continuously 


differentiable solution u = 
satisfies u^ = u^([x',V]l). 
in the following way. 


Ujj^ defined on which 

Wo ’define subsets 6^ CT^M ^ HR 

[x'/X']- 


= {([x/X],u) e T^M X OR 


[xA] e a' and 


u s= u. (fx/X^^Jj i =lj'2,... 


( 2 . 12 ) 


Then are 2 n- dimensional submanifolds of T^I x ]R, wo 

[x'A'] 

call those submanifolds * local extremal surfaces for the 

point [x'/X^]*. Although for each u^ it is possible to 

choose different and therefore construct different 

, we choose just one o\ and construct just one 
[X-.X'] ^ 

for each uf* We extend this process and construct 

local extremal surfaces for all points [x/X] ® T%1. 
i i 

Lot e- , land s? . n be two extremal surfaces for 

[x2'^2J 

the points [xjj_/>^] and respectively. 
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e 


i 


6 


j 

[X2/X2] 


{([x/\],u) e T'^M xM i 

• [x/X])}/ 

{([x,X],u) s T% V H S [x,x] e a^j 


u = U2j'([x/X])} • 

Here the open sots *^23” continuously differentiable 

functions and U2j have meaning as in ( 2 . 12 ) • We say 


overlaps 5' 




) if 


(i) ^2j ^ ^ 

( ii) thore exists a continuously differentiable function 

^i2j * ""li^ ''2j ^ 

Ui^( [x,x]) ¥ [x.X] e 

U2j([x/X]) ¥ [x,x]e a2j* 


Moreover the surface 

and u = £2 j ^ ^ ^ 

is said to be the join of ^ ,x^] ®[x2/X2]* 

' The local extremal surfaces have the following 
properties : 
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(i) Two different local extremal surfaces for the same 

point do not intersect. Suppose contrary, 

f) <P / and let ([x,X]/u) belong 

[x^,X'] Cx',X'] 

to this intersection. Then through the point ( ![x/X]/ti) , 
the equation h([x, ?v.],u) = 0 has two solutions viz. 
u » u^( [x,\]) and u = ti 2 ([x/X]). This violates the 
conclusions of the implicit function theorem. 

e-p ^ 1 be two local extremal 

surfaces for the points and [X 2 /X 2 ] respecti- 
vely. Then disjoint 

or overlapping. For if tlio two surfaces have common 
points and fail to overlap, the conclusions of 
implicit function theorem are again violated. 


(ii) Let and 


So far we have been carrying out local analysis 
and what is important is that it could have been carried 
out locally only. But local extremal surfaces are sub- 
manifolds of the same dimension, 2n, everywhere. There is 
therefore no difficulty in joining the overlapping surfaces 
together. If now we join all overlapping local extremal 
surfaces together, we would end up with a number of 2n- 
dimensional submanifolds in T^M x m. 'We call these sub- 
manifolds ^extremal surfaces". We stipulate each extremal 
surface to be maximal with respect to connectedness. 


Let W denote the projection 
3! t T M X ]R -► T M 
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defined by 

^([x^X]^u) = [x/X]. 

The extremal surfaces as constructed above have the following 
two properties : 

(i) IIo two extremal surfaces intersect. This follows 
directly from the properties of local extremal surfaces. 

(ii) Let E be an extremal surface. Let [x/X] © T^^« Then 
there does not exist more than one real u such that 

( [x/X]^u) s E, 

Proof. Suppose the aforesaid property is false. Let E bend 
over itself once. In other words we can find points of the type 
fx* 8 T'^^M such that there exist exactly two real numbers^ say, 

Uj_ and \i 2 with the property that ( [x^/XqD e E for 1=1,2 

and that we cannot find more than two such real niombers for any 
point belonging to T^. It therefore follows that through the 
point ® there exists a curve lying in T**‘M such that 

as \'re proceed along this curve from the point [x^/Xq] / | 

corresponding to eve'ry point on the curve there exist two points j 
on the extremal surface E until we reach a point r 

corresponding to which there exists just one point, say, ^ 

( [35^ ,v) 8 E. That such curve exists can be ensured by the i 

connectedness of E. Denote by C the set of all points on this 
curve in between [x^/X^J and [j^/X^]. Let N be a neighbour 
of ([xj^/X^]/v) in T% We can choose N such that e = N 

is a local extremal surface for the point 
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,v) • This is possible because extremal surfaces are 
constiructed by joining overlapping local extremal surfaces. 

Let N' s= ^(N) and N" a (N^H C) - { [ 3 ^ } • Then for every 
point [x/Xj 6 N" we can find two real numbers Vj^ and V 2 / say/ 
such that ([x/X]/V^) belongs to s, i » 1/2. This, however, 
contradicts the fact that s is a local extremal surface. E/ 
therefftre, cannot bend over itself, or for that matter it 
cannot bend over itself more than once. Property (ii) is 
therefore true? 

Consider an extremal surface E, The set 5f(E) = S is 
open in T'*^M# It follows from our previous discussion tbat 
there exists a continuously differentiable function ^ ( [x, X] ) / 
defined on S satisfying h([x,X], 3^C[x,X])) - O everywhere on S, 
so that E can be written in the form 

B « { ( [x/X] /u) e t'^^M >« ]R : [x,X ] e S and 

U =2^([x,X])}. ( 2 . 13 ) 

That ^([x/X]) is well defined follows from the Property (ii) 

Qf the extremal surfaces and that it is continuously differen™* 
tiable and satisfies h([x,X], y([x,X})) = 0 on S follows from 
the fact that extremal surfaces are locally ecfuivalent to local 
extremal surfaces. 

Define two more hypersurfaces in T% x H 

* {( [x,X]/u) s X m : [x,X] s T% and u = 3^^(Cx^x3)= a} 

( 2 .1 3a) 


♦ See Appendix -A 





E = £([x,\]yu) e t'mxor : [x,\]e t^m and u = [x,\]) = b} 

(2 a 3b) 

We now extend the definition of extremal surfaces so as to 

^ "K 

encompass -bhe surfaces E and as well/ i*e*, from now 

3 . 

onwards by extremal surfaces we mean the surfaces E and E 
in addition to all the surfaces meant so far. It may be 
noted that the now set of extremal surfaces do not possess 
the property (i) of the old set. 

An extremal surface 

E = £([x/X]/u) e A X n ; [x/X] e S/ u = vC[x/X])J 

is said to represent the optimal control at a point [x/X] s 
if there exists a neighbourhood N of [x/X] such that the 
function ^([x/X])) represents the optimal control in N (see 
page 31). The definition implies that E represents the 
optimal control at every point in N. 

We recall that Z is the set of all points in T**'M 
through which there exists an optimal path and assume that 

lnt( Z) / <P and Z C Int TzT 

S 

Lot S be an open sot in such that D CZ and for 
all points [x/X] s S j 

(A^) H( [x/X]/u) considered as a function of u has only non- 
degenerate critical points in the closed interval j 

[a/bj and 


CA2) thG points ([x,x3/a) e and ( [x/Xj/b) e do not 
belong to any other extremal surface* 


Choose two points and [X 2 /X 2 ] ^ such 

that (i) the Hamiltonian H([x£/X^]^ u) considered as a 
function of u in [a,b] has a single global maximum? i = 1/2/ 
and (ii) there exists an optimal path a ; ■* 

joining the two points and contained entirely in S « Let u(t) 
bo tho governing optimal control* 

The way S is constructed enables us to invoke Lemma 1 

and assert that around every point [x/X] e S / at which 

H( [x/X]/u) considered as a function of u in [a/b] has a 

single global maximum/ there exists a neighbourhood N and an 

extremal surface E which represents the optimal control in N. 
1 2 

Let E and E be the extremal surfaces which represent the 
optimal control in suitable neighbourhoods of [s^liX-j^] and 
[X 2 /X 2 ] respectively* Using tho notation followed in (2*13) 
wo write 


E^ = £([x/X]^u) B 






[x/X] e and u ~ ^^([x/X])}? 


i = 1/2. 


We then have as a consequence 


u(t^) = ( [xl ,X i ] ) and u(t 2 ) = 2 ^ 2 ( [ 3 C 2 /X 2 ] ^ T 

If now we assume that no other extremal surface o»3ept 
for E^ and E^ takes over the role of representing the optimal 
control along the optimal path a, then there exists a point 
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[x' / ] on a where the optimal control switches from Er to E“ • 

By the statement "the optimal control switches from 
1 9 

E to E at f ]" we mean that at every point on the 

portion of the optimal path between [3C[_Aj^] and ”■ 

point [x'jfX'] excluded - the optimal control is represented 
by the extremal surface E or, equivalently by the continuously 
differentiable function ^ 2 ^([x/\]) while at points on the 
portion between [x^/X^] and “ Cx^/^^] excluded — 

it is represented by E^ or equivalently by 3^2^ 

As the extremal surfaces do not intersect it follows 

that 

Therefore at t' e (t^,t 2 ), the time when we reach the point 
traversing the optimal path ot/ the governing optimal 
control u(t) suffers a discontinuity. The value of the optimal 
control at [x'/X^] according to our convention (Section 1-1) 

is given by 

extremal surfaces other than E^ and E^ which take up the role 
of representing the optimal control between [xj^Ai] and 
[X 2 /X 2 ] . then we simply have more number of switchings similar 

to the one just discussed. 

We now proceed to analyse the reasons for which the 
optimal control switches from one extremal surface to another. 

Eor an extremal surface 
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= {([x,x],u) s IR : [xA] e and u = yj_([x^X])} 


whoro Sj^ and have their usual meaning^ we define a vector 

field Xy on by the formula 


ij 3 

Xy_([xA]) = 2 f3^(x,Vj^([x,X])), 

r !k— 1 jc 


n n 

+ E (- 2 X, 
k=l j=l ^ 


af.(X/V^([x,X])) g. 

— >3-7:;; 


(2.14) 


Where x^^x^, . . . ,x^? define a coordinate neighbour- 

hood about the point [x/A] s in the sense that x^t^* *• 
define a coordinate neighbourhood about x = ^ [x.\] e M and 


X =s E 7^-j dxj (See page 24)# 

Let [xqI^q] be a point in S * For the sake of argument 

let us say that the Hamiltonian H( [xq/Xq] j-u) considered as a 

function of u in [a,b] has a single global maximum. Let 

a 

.../E”'/ be the m extremal surfaces (these include E 
und E^ also) such tlriat each* of the respective open sets 
S^,S2#vvSm contains the point [xq/Xq] (and the collection 
£E^}, i = contains all such extremal surfaces). Of 

these let represent the optimal control in a neighbourhood 
of [x^rX^]. Then we must have 




i = i ^ k 
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The optimal path starting from and in its immediate 

vicinity/ is nothing but the integral curve of the vector 
field through [ J • Now the rate of variation of the 

Hamiltonian function H([x/X]/ [x/X])) as we proceed along 

the integral curve of X^ is given by the Lie derivative 

H( [x/X]/ [x/X]) ) [ 3,p’,5-24] . Let x^/X 2 / • • • #X 2 / • • */- 


:;.C-ir*o a coordinate neighbourhood about the point [x,X]/ 

r n 3H([x/X]/ 3^/([x/X])) dx. 

VC-A])) ^ 


rl 


WO then have 


^ 3h( [x/X] , V. ( [x/X ])) dX. 

+ s -—i J 

j-4 dt 


3H( [x/X]/ [x/X])) ^ n aa'jj^([x/X] ) dxj 


n aVj^([x/X]) dXj^ 

S ■—) , 


j-4 3Xj 


aH([x,x]/ y^CEx/X])) _ 

Mow ^ H h([x/X]/ a^^([x/X])) = 


and 


dxj 3H([x/X]/ \([x,X])) 


dt 


3 Xj 


d Xj 3 H( [x/X ] / \( [x/ X] )) 

I M ' » — — ■ 

dt 3Xj 


Hence H( [x,X]/ i^^([x,X])) 
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n 8 H([xA]/ ^i([x,X])) 

° 

n 9 H([x/X]/ a^([x,X])) 3H( [x^X]/ 

==> H([xA]/ V 3 ^([x,X])) = 0. 

\ 

In otherwords the Hamiltonian hC[x/XJ/ ^^([x/X 3^^ remains 

constant/ at the value 1/ say/ as we proceed along the 

integral curve of while the other Hamiltonian functions 

\ 

H( [x/X]/ Vj_([x,.x3)) along it. 

At the point H([x/X]/ greater 

than all H( [x,X]/ Vj^([x/X]))/ however./ as we proceed along tli€ 
optimal path through [x^/X^] some of the Hamiltonian functxc 
H([x/X]/ 2^^(Cx/X])) may start increasing and catch up with th€ 
value 1. Let [x',X'3 e £ be the earliest point on the 
optimal path when this happens - where the function 
H([x,X]/ Vj([x/X]))/ (where j is an integer between 1 and m, 
j ^ X) attains the value 1. The Hamiltonian hC[x/X 3/'^) 
considered as a function of u in [a/b] therefore has two 
global maxima when evaluated with [x/ X] = [x / X ] 

1 =H([x',X']/ Vi,([x'/X'])) =H([x'/X'] / 2/j.([x'^X'])) 

> H([x'/X' 3 / i i / 

[x'/X'3 is a potential point where the optimal control may 
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i 

switch from E to E • We discuss this below^ with rogard to 
which wo take note of the fact that 


^^j([x,k])) = -L> H([xA]/ ^v([x,\])). 

J/ j 

3 

(i) Consider first the case where at the point [x'/X'] 
we have 


^ H([x,X], 


< 0 . 


(2,15) 


This implies that H([x,X], v.([x^X])) > 0 at [x^X] = [x',^ 

\ 

If from the point Lx'jX' ] we traverse the integral curve of 
Xy j H(|^x#X]/ ^i^[^'X 1)) increases while H( {x, X]j/ W.( [xy X]) ) 
remains constant. Hence at all points [x,X] on this integral 
curve/ in the immediate vicinity of [x'/X'J we have 


H( [x/X]* Vj([x/X])) > H([xiX]» [x#X])) = 1 > 

H( fx/ X] / C X/ Xj ) ) j i k/ 

On the other hand if we go along the integral curve of 
from {x'/X^] instead, H([x/X]/ ^^^([x/X])) decreases \dnile 
H([x/X]/ Vj([x,X]) ) remains constant at the value 1 • At 
points [x,X] on this integral curve close to the point 
[ x' / X* ] we then have 

1 = H( [x/X]/ ^^([x/X])) > H( [x/X]^ Vj^([x/,Xp)/ i 5^ !'• 

Maximum principle therefore dictates that the optimal path 
from Ix',X^] onwards and close to it will he the integral 
of the vector field Xy through [x',X']. Thus the 


curve 
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optimal control switches from extremal surface E to the 
extremal surface E^ at [x'/X']. 

(ii) If on the other hand at the point [x'/X'] we have 

^ H([xA]/ \([x,x 3 )) >0 (2.16) 

a similar argument leads us to conclude that the optimal 
control docs not switch at [x^/X^]. Hi® optimal path from 
[x^/X'] onwards continues to be along the integral curve of 

k 

(iii) If V 70 have instead 

H([x',X']y a'j([x',X'])) = O and 

mtxSX'] , 5;^([x',X'])) = 0 (2.17) 

it may happen that H([x/X], ^^^([xA])) remains constant along 
the integral curve of through [x'/X'] while 

H( [xA]i remains constant along the Integral curve 

of Xy through the same point. Then as far as maximum princip] 
goes we have equal liberty of choosing either of these integra] 
curves as our optimal path. Wo encounter a singular problem 
hero. Such situations will not arise as by assumption our 
system is non-singular. We therefore expect the Hamiltonian 
H([x,K]/ »k<Cx,q)) (h([jc,X], Vj([x,X]))) to vary along the 
integral curve of (X^, ) through [x',!']. The nature of 

j ^ 

this variation can be determined by examining the higher order 
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Lie derivatives of hC [x/X] , 2>j^( [xy X]) ) (h( [x, X] / [x, X]) )) 

with respect to (X^ ) . An application of reasoning similar 

j k 

to that developed in (i) and (ii) above can then determine 
the optimal path through [x'.X'] . 

Wien the Hamiltonian H([x',X']/ u) considered as a 
function of u in [a,b] has more than two global maxima, the 
optimal path from [x',X'] onwards can be chosen by considera- 
tions essentially similar to those in (i), (ii) and (iii) above 
For example if at. the point [x'/X^J we have 

1 = H([x'/X'] , [x'/X'])) =H([x',X']/ Vj([xSX'])) 

= H([x',X'], ([x',X'])) > H([x',X‘], I'.CExSX'])) 

i j#k,p 

then 

H([x',X'],, Vj([x',X'])) < 0 and 

H([x',X'], V[x',X'])) < 0 
]c 

imply that the optimal path from [x',X'| onwards will be along 
the integral curve of X through [x^/X^J* The equalities 

K([xSX']/ i^j([x',X'])) = H([x^X']/ Vp([x',xn)> 

^ ^2 • 

= IV H([x^xn ' ^],C[xSx'])) =0 

V 

p • 

on the other hand demand the examination of higher order . Lie 
derivatives to determine the optimal path through [x'/X^]*- 
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An altogether new situation arises if the inequalities 

^■k 

Lj, H([x'/\'J f ([x'/X'])) > 0 and 
3 

hy H( [x' , [x' / X' ] ) ) > 0 hold, 

p 

Evidently from the point [x^ / X' ] onwards the optimal path 

cannot toe along the integral curve of any one of the vector 

fields X / X . , X„, • However/ in a well defined system the 
j to p 

optimal path ought to be able to continue past ■[ x'' /X ]} . The 
way out iS/ for the Hamiltonian H([x^/X^3/ u) considered as 
a function of u in j^a/bj to have developed more than three 
global maxima. 

In general therefore all .points [x'/T/] 6 S where the 
Hamiltonian H([x'/X']/ u) considered as a function of u in 
[^a/b^ has multiple global maxima/ are potential switching 
points of the optimal control and the mechanism of switching ^ 

is merely a generalized version of the process just discussed. i 

f 

The analysis, so far, leads us to several important | 

facts* First of all we note that the open set S contains two j 

« 

types of points i j 

(T^) Ti-xe points [x/X] for which the Hamiltonian H([x/X]/u) j 

considered as a function of u in [a/b] has a single i 

global maximum and ; 
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("^ 2 ) The points for which it has multiple global maxima# 

From Lemma 1 it follows that the set of points of 
the type open subset of S • The assvmvption that 

our optimal control is non-singular with respect to the 
maximum principle prohibits the existence of a non-empty 
interior for the set of points of the type (*^ 2 ^* For/ 
suppose there exists a point [x'/X'j s S and a neighbourhood 
N around it such that N consists of points of the type (*^ 2 ^ 
only# then the optimal path through penetrates N# 

However/ once we are inside N/ maximum principle fails to 
specify the control and it will not be possible to construct 
the optimal path using it* 

At every point of the type the optimal control is 

represented by one of the extremal surfaces/ while at points 
of the type optimal control possibly switches from 

one extremal surface to another# Let A^ denote the set of 
points in 2 at which represents the optimal control and 
we consider all such A^ (corresponding to all extremal surfaces 
E^) which are non-empty# A^ are evidently open subsets of 2 

and we have 

A^O A-^* =s <P for i j* 

We decompose each A^ into its components 

Ai=Ua-^ 

where each 4 ® maximally connected component of Clearly 

the subsets 4 (for all 1 and all k) are open subsets of S 
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disjoint from each other# In the relative topology of S 
the boundary of each a^ consists of points of the type (*'• 2 ^ 
where the optimal control switches from to other extremal 
surfaces or vice versa# There may also exist points on the 
boundary such that for an optimal path through it the optimal 
control does not suffer any switchings there (see condition 
(2 #16))# Moreover it is evident that 


s « U A. « U ( U ii) 

i ^ i k ^ . 

where the closure is taken in the relative topology of S . 

The optimal control at every point in and at some points 

on its boundary is represented by the continuously differen** 

tiable function vA[x^X]). Let C denote the set of all 

k 

such points* we have 

and E is the disjoint union 


s = U c . 

i/k ^k 


This enables us to conclude the following i 

There exists a partition of S by partition sets {C^} 
say* such that 

(i) have a non-empty interior and 




(ii) are connected. 




’ V* ■ » ■* M 

\ II3J5 
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( iii) there exists a continuously differentiable function 

^i * 3R— is an open subset of £ with 

such that represents the optimal control on # 

( iv) are maximal with respect to (ii) and (iii)* 

(Note ! Closures and interiors are with respect to the relativ 
topology of S ) • 

The partition sets will be called control regions* 

In the next section we generalize our analysis to construct 
control regions for the whole of 2, 

Remark 1* When an optimal path crosses over from one control 
region to another the optimal control switches from one extrem 
surface to another and suffers a discontinuity in the process. 
We reserve the term ♦discontinuous switching"to describe this 
phenomenon. This is because at a later stage we encounter a 
phenomenon where the optimal control switches from one extrema 
surface to another without suffering discontinuities. 

2-3 Control Regions 

To effect a partition of 2 on the lines suggested in 
last section, we analyse the two assumptions which were used 
to define 2 (See page 40). ^or, if we succeed in disengaging 
ourselves from those two assumptions and still be able to 
construct control regions, our purpose «oula have been served. 

The first assumption for S stipulates that for the | 
points Tx.X] e E , ,U) consiaered as a function of u ! 

in [a.b] has only non-degenerate critical points. If »» negj 

i 

! 
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this and allow degenerate critical points to occur as well# 
a host of new situations arise# This is because if ^ 6 

is a degenerate critical point, through the point 

s T*m X 1R, we can no more, expect a unique local 
extremal surface for the point to exist# In fact it 

is possible to have more than one local extremal surfaced or 
none at all through This follows from the theory 

of branching of solutions [6]# 

From now on, we abbreviate the statement "critical 
points of H([x/Xj,u) considered as a f\anction of u in [a,b]" 
to read "critical points of H([x/?v],u) in [a/b]"* ^or a 
given if H([x,Xjl,u) has only isolated critical point; 

in [a,b] , it follows that they are finite in number. On tl 
other hand if the critical points in [a#b] are not all 
isolated, it is possible to have an infinite number of isola' 
critical points in [a,b]. This however is an extremely 
unusual situation and very difficult to treat# To 
oihviate such difficulties we make the following 
assumption. We assume .. that H([x,\],u) can 
have critical points in [a,b] in either or both of the 

following two ways : 

(i) it can have finitely many isolated critical points, i 

(ii) there may exist an interval [c,d] , a < c < d < b, | 

such that every point in [c,d] is a critical point, 
in such cases H(Cx,Xll,u) considered as a function of | 
u remains constant in [c,d]l • . j 
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further categorize these critical points into the followinc 
three categories : 

(Cl) From among the critical points of K([x,X]/u) in [ayh] 
we first choose those points, say, lying in (a,b- 

such that through each ([x,Xj,u^) e T*M x k, i = 
there exists a unique local extremal surface for the point 
Ex, • The set • *, 0 ^} therefore consists of all non - 

degenerate critical points together with some well behaved 
degenerate critical points, and has a finite number of elemen 
because it is a subset of the set consisting of all isolated 
critical points in [a,b]* We now extend the above selection 
to all points in T^M, construct local extremal surfaces (when 
possible) for each of than and join the overlapping local 
extremal surfaces. We end up as before with a n\imber of 
2 n— dimensional submanifolds in x hrj these are our new 
extremal surfaces^. As before we stipulate that the extrema 
surfaces arc maximally connected. We also construct the 

cl b 

surfaces E and E as defined earlier and call them extremal 
surfaces. If u = a(b) happens to be a critical point of 
H([x, ^]/u) for some [x, X] 6 T^M, it is taken care of by the 
extremal surfaces E (E ). in the process we also allow E^(E 
to have points in coxranon with other extremal surfaces, theret 
negating the second assumption required to selects (See page 

(C2) Next consider the degenerate critical points of 
H( [x,X]/u) of the type T) e [a,b] , such that the point 
( Ex,X] ,7)) belongs to the boundary of some extremal surface E 
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as constiructed in (Cl)- and such tha-t it is possible to extend 
E smoothly up to the point (Cx/\]/T]) (Here by the boundary of 
E we moan the set 1 - E) . We extend E smoothly up to the 
point ([x/X], 77 ) and continue doing this for all [x, X] e T*H 
for which H([x,X],u) has critical points of the type *0 . 

Novr critical points of the type 17 are not of the category 
(Cl), however/ for some of tliem it may be possible to have 
more than one local extremal surfaces through (fx/Xj/77)* In 
such cases wo can extend the extremal surfaces (as constructed , 
in (Cl)), which overlap with the local extremal surfaces 
through ([x/X]/77)/ up to the point ([x/X]/77)/ ty the simple ; 
process of joining them with the local extrcamal surfaces they 1 
overlap with* We now extend the definition of the extremal j 

surfaces to encompass the extended extremal surfaces as well* i 

It is now possible for any two extremal surfaces to have common i 
points and these common points ( for surfaces not involving E^ j 
and E ) arc of the typo ([x/X]#^) just discussed* i 

(C3) In the third category we dump all degenerate critical 
points of H( [x/X]/u) which are neither of the type (Cl) nor 
(C2)* Let n* 8 [a/b] denote such a critical point* If G bo 
the set of all points in a neighbourhood of ( [x/Xj/Tl') at 
which h( [x/X]/u) = 0, G can have rather bizarre shape* 

Wo assume that for an optimal path through [x,X]# tt' is 
barred from representing the value of the optimal control at 
[x, X] by the non-singularity condition. That this is true for 
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well behaved systems of the form (2.1) can be seen directly 
from the following discussion* 

First of all we note that an isolated critical point of 
H(|]x/X]/u) is either of the type (Cl) or (C2) and therefore 
not of the type (C3), On tlie other hand let all the points 
in some interval [c,d3 ^ a < c < d < b/ be critical points 
of H([x,X]/u), H([x/X]/u) considered as a function of u 

remains constant in [c^d] • If 6 [c/d] is not of the 
type (C2) then it is of the type (C3). Now if H( [x/X] /u) 
considered as a function of u in attains global 

maximxom at u = T)\ it does so at all points u e [c,d]j with 
regard to this property therefore it is not possible to 
distinguish ff from other points in [c*d]. In other words 
the fact that n' represents the value of the optimal control 
at [x^X] cannot be concluded from the maximum principle* But 
the optimal control is assumed to be non-singular it is 
therefore clear that for a possible optimal path through [x,X; 
the value of the optimal control there cannot be Tl'* | 

We can therefore forget the critical points of the type : 
(C3). Critical points of the types (Cl) and (C2)/ only, can i 
represent the value of the optimal control and they lie in ; 
one or other of the extremal surfaces* i 

I 

This concludes the discussion of situations that arise | 

when the two assumptions that were reejuired to select £ are i 

i 

ignored* ! 

s * i 

I 
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The extronal surfaces, as redefined, retain their old 
form with a slight change. If is an extremal surface we 
can write 

E =5 {([x,X],u) E T*M X IR • [x/X] e and u = ^'^([x/X])}* 
i 

Here ^ (E ) need no more be opon always. however as o 3 

old, is a continuously differentiable function defined on 
(which is possible by virtue of the construction of E^, see ali 
page 29) such that h([x,Xj,Vj_) = 0 [x,X] e s^. Further 

= £( [x,X]/u) e T*Mxm : [x,X] e T^M and u = v^([x,X]) = a} 

I- 

E = {( [x, X]/u) E i^M ’'HR t [x/ X] 5 and u = ^j 3 ^[x,X]) = hi 


We now turn to the problem of partitioning Z into control 
regions. Lot the optimal control in a neighbourhood of a point 
[xj^^Xj^] S 2 be represented by the extremal surface E • The 

optimal path, through and in its immediate vicinity ; 

I 

is along the integral curve of the vector field • Traversi 

this optimal path, we reach a point , say, where the 

value of the optimal control is ' u* and suppose ( [x*, X'*^]/'u.*) | 

t 

i s common to the extremal surfaces E^ ,E^,..., e'^ • The optimal I 
path from [x*, X*] onwards is along the integral curve of | 


one of the vector fields fX^ ,.*•,3^ and the appropriate j 

1 2 ‘ m I 


vector field can be determined by an analysis similar to that, 
employed in the last section to study the discontinuous ! 

i 

switching of the optimal control. Eor example the optimal pa: 
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is along the integral curve of X if the inequality 

^2 

/ VjL^[x*,X*])) < 0 (2.18) 

2 

holds for all i s for which the Hamiltonian 

is defined along the integral curve of 

a 

through the point [s^zX*]. We say that the optimal control 
switches continuously from to at • This is 

because at 

Vl([x\v'] ) = V2^[x*zX*]). 

We call svd-tchings of this type ^continuous switchings" to 
emphasize the fact that they are different in nature from tlie 
discontinuous switchings discussed in Section (2-2). While 
discontinuous switchings make the optimal control discontinuou 
continuous switchings make it non-differentiable at the worst. 

With regard to condition (2.18) we would like to mention 

a special case. Let H( /u) considered as a function of i 

I 

u in [u/b] have a unique global maximum at u = b and let ! 

1 j j 

( [x*zX'''] /b) be common to both E and E . The optimal path 

from will not be along X^ if V^([x/X]) > 0 

i j 

at [x*zX^]. This is so even if the supporting inequality 

f 

t 

H([xzX]/ ° I 

^i 1 

[ 

holds. For, otherwisez the value of the optimal control will! 
exceed b which is not acamissible. The optimal path will be ! 
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along the integral curve of X,i through the point fx**’ $ • 

The above analysis shows that there exist the follov/ing 
three types of points in Z, 

(T^) Points of the type [x/X] e Z at which the optimal 

control is represented by one of the extremal surfaces 
(T2) Points of the type [x'/V] s z such that as we 

traverse the optimal path through the 

optimal control switches discontinuously from one 
extremal surface to another at this point. 

(T3) Furthermore points of the type [x*/X^] such that as 
we traverse the optimal path through 1 the 

optimal control switches continuously from one extrema! 
surface to another at this point. 

The analysis, from now on, is confined to the point; 
in the subset Z CT“‘M - the topology therefore is the relativi 

i 

topology of Z. J 

I 

The set of points of the type (Tj^) is an open subset of 
while the set of points of the type (T2) has an empty interic^, 
this was established in the last section. Furthermore, it i^ 
clear from the definition of the extremal aurfaces - as redef 
in this section - that the set of points of the type (T3), 

I 

has an empty interior. : 

Let denote the set of all points in Z at which I 

represents the optimal control and we consider all such A | 
(corresponding to all extremal surfaces E^") which are non^ 
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A aro Gvidently open subsets of Z and we have 

A^n A^ = for 1 ^ j. 

We decompose each a' into its components 


= Uai 


where each a^ is a maximally connected component of A^# Clearl 
the subsets a^ are open subsets of Z disjoint from each otheri 
The boundary of each a^ consists of points of the types (T 2 ) 
and (T^) whore the optimal control switches discontinuously 
or continuously from to other extremal surfaces or vice vgj 
There may also exist points on 3a^ such that for an optimal 
path through it the optimal control does not suffer any 
switchings there (See condition (2',16)), Moreover it is 
evident that 

-i. 


'Z ^ U A. s U ( U 

i ^ i k ^ 

The optimal control at every point in a^ and at some points 


on its boundary is represented by the continuously differcnti< 
function v^([x#X])* Let C. denote the set of all such poin 


then 


-x 


C. C. a. 


i-u — 3c 
k 


and Z is the disjoint union 


Z s U C . 

i,k ^k 


This enables us to conclude the following : 
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Tliorc exists a partition of Z by the partition sots £Cj^} / 
say, such that : 

(i) have a non-empty interior and 

(ii) are connected 

(iii) there exists a continuously differentiable function 

u. : S. ]R - with C. C S. - such that' represents 

the optimal control in and 

(iv) arc maximal with respect to (ii) and (iii)« 

The partition sets {C^} will be called control region 
The family of continuously differentiable functions 
collectively define the optimal control in the form 

u : Z B J 

u being defined by the straightforward rule 

u([x,K]) =Mi([xA]) whenever [ X/X] e r 

When an optimal path crosses over from one control region to 
another, tho optimal control when represented in the form C*) 
suffers a discontinuous or a continuous switching. 

Lot the optimal path through the point bo 

governed by the optimal control, when expressed as a function 
of time, u(t). We start from [x^/X^] at time t =* O and 
roach (sny> at time t = T whom the optimal control 

When roprasantad in tho fom (*) switches dlscontinuously 
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from one extremal surface to another? it follows that at t = t 

u(t) suffers a discontinuity* On the other hand if the optima. 

control, when represented in the form (-jv), switches continuous, 

f 

at becomes non-differentiable in the process, 

uCt) is likely to be non-differentiable at t = t • Conversely 
it is clear that as wc traverse the optimal path a discontinui 
or a non— differentiability of u(t) occurs only when the optima 
control v/hen represented in the form (*) switches from one 
extremal surface to another* 

We conclude this chapter with some observations regardi 
the nature of the control regions* Let and Cj be two 
control regions in Z. We denote by the set of all points 

in 3 where the optimal path crosses over from the control 
region into tlie control region C^* Below, we make some 
comments abOT.it these sets for well behaved systems. 

Wo consider a simple case to begin with* Let and C2 
be two control regions and let the optimal control in 
and IntCCj) be represented by the extremal surfaces E and E 
respectively. Further, suppose, the control region is such 
that there exist optimal paths from to €3 or from 03 to 
or both and that no other control regions except for is 
similarly related to C^* 

At all points [x,X3 ^ ^12 

g([xA] ) = H( [x,X] / [x/X] ) ) ) ) - O 
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and 

/ 3;2([x/X])) >0. 

'^1 

Similarly at all points ® ^2l 

g([x/X]) = H([x/Xj/ 3^2 ^([x/X])) « H( [x^X] / 2^2^ ® ^ 

and 

H( [xA] / 3^i([xA] )) > 0 . ^ 02 ^ 

Now g( [>!^/X] ) is a smooth map and has rank 1 at every point 
where either the condition ( 9 ]^) oi" the condition ©2# without 
the equality sign, is satisfied. In well behaved cases the 
situation therefore can be as follows : 

Ci) Both Int (C^ 2 ^ (interior in the relative 

topology of dC^) are ( 2 n-1 ) -dimensional smooth manifo] 
conditions (e^) and (62) without equality sign being 
satisfied respectively. 

(ii) Int (C3_2) and Int have a common boundary A (in el 

relative topology of 3 ^) and 3 is the disjoint 

union 

« 

ac^ = Int (Ci2^ ^ ^ ^ • 

(iii) At points [x,X] e A 
g([x/X]) = 0 

H([x,X] # 0^2 


and 



64 


Now whether the optimal path crosses over from to C 2 or 
from C 2 to at a point, say, [x'/X'] e a can be determined 
by examining the higher order Lie derivatives as mentioned 
before# In most cases the optimal path turns back into 
the control region from which it came, at points (not all)belon' 
to A - we will see this later. 


In general given the control region we can find 

more than one other control regions of the type Cj such that 

there exist optimal paths from to Cj or from Cj to or 

both, Ihe sets C. C.. are subsets of 9C. • Consider all 

ij 

such subsets which are npn-empty. In well behaved systems 
Int(C^j), Int(Cjj^) (interior in the relative topology of 
are (2n-l) dimensional smooth manifolds and Inttc^j ) , 

similarly C Int(cy^) . C^/ ^ are therefore (2n-l) 

dimensional manifolds vuth boundary. These boundaries are not 
smooth in general. 


Let L denote the set of points such that 
[x*,X*] e A ==> ^ point common to the boundaries - 

in the relative topology of aC'^ - of each of the sets 

n r r 'C .,C. , and none elSfcf 

^ k + l ^ ^^+2 

(k+m) < 2n, Tlien A in well behaved cases is a smooth manifol 
without boundary, 2n-(k+m) < dim A < 2n-l and A - A consists 
of points V7hich are common not only to the boundaries of 


J • 


111 '- 


» • ,c. . / c 


IJ 


Jk+1^ 


. , , • • /C . 


•^k+m^ 


. , but also to the boundaries 


of some other such sets* 
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Tlius far we have been considering well behaved cases 
only. Given two control regions C. and C. such that there 

J- J 

exist optimal paths from into the set need not 
always be a (2n—l ) -dimensional manifold. It can be a manifold 
(with boundary of course) with lesser dimensionality and in 
worst cases can even fail to qualify as a manifold. 

Finally, we observe that if there exists an optimal 
path through [ J and if u^ is the value of the optimal 
control there, then there also exists an optimal path through 
[Xq/(X Tvq] t for all real a > 0, where the optimal control has 
the value again. Therefore for a control region 

|[ x#Xj 6 ==> |^x,ctXj ® V ot ^ 0. 

With this we conclude the present analysis. 



CHAPTER 3 


TRANSVERSALITY CONDITION 
3-1 Preliminary Remarks. 

With the help of the constructions made in the last 
chapter we now proceed to seek conditions which when imposed 
on the system* under consideration* put an upper bound on the 
number of discontinuities of the optimal control. To 

facilitate this investigation we first define a few terms, 

M 

Let A be a submanifold of 1*M. Wo denote by 3 A the 
boundary, in the manifold sense, of the sutananifold A and by 3-? 

the topological boundary of the set A. Further IntC^^.)==A-aft- 
and Int^'(A) = A— 3 A, C4#pp« 57—583 • 

Lot V and W be two boundaryloss smooth submanifolds of 
a smooth manifold N, Then V is said to bo transversal to W - 
denoted by V ^ either if 

V n W = <P_ 

or if V A W / <1P and for every x s V O W 
Vx + 

Those conditions arc synimotrlc with rospoct to V and W honoc 
V W <«=> V [4,p«29] • 

3-2 The Transversality Condition . 

wo arc considering the tima optimal control problem as | 

defined in section (1-0 . M, as Wo have said baforo, denotes | 

i 

^ I 

I 
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the state space and is of dimension n and denotes its 
cotangent bundle. We partition Z C T**'M into -the family 
of control regions. 

I 

(Note j The control regions are connected subsets of Z, There 
may however exist control regions of the type such that 
Int(C(j) is not connected. For such control regions wo adopt 
the following convention. Wo express Int(C^) as the disjoint 
union of its components (maximally connected), say, in the 
form, Int(C^) = U D 2 U ... UD^. Next we partition by 
expressing it as the disjoint union, ^ *** 

• •• U , such that IntCC-, ) = D^, 1 1 s < h. We now treat 

• • s 

each as a separate control region). Let IX^t ** [a,b] 

bo the Lntinuously differentiable function that represents 
the optimal control eveiywhoro on C^. The function is 

actually defined on a bigger set which is the projection 

under f of the extremal surface that represents the optimal 
control in IntCC^). We define a vector field on by the 

r 

rule 

Xj_ = fj.(x,uii([3C*^])^ + 


n 


n 


+ S ("** S X] 
j=iL T<i=l 


_a_ 

3Xj • 


(3.1) 


3X. 


Horc " co-ortinato nolghtour- 

hood (SCO page 43 ) atout tha point in We denote 

by Si(t,,) the one parameter group of diffeomorphisms induced 

by Xj^. . ' ■ 
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Lot Xq bo a point in the state space M* Prom among 

all the optimal trajectories that pass through we propose 

to choose ono arbitrarily and count the number of discontinuitic 

of the optimal control along this trajectory. Equivalently 

wc can choose a point e m* n Z arbitrarily 

^o 

([Xq/Xq 3 ^ and study the number of switchings that 

the optimal control suffers as we traverse the optimal path 
through '[x^/X^]- In order to put a bound on the number of 
discontinuities of the optimal control it is sufficient to 
consider the discontinuous switchings only^ however/ it turns 
out that the continuous switchings have also a role to play. 

Let C. • C 9^* <3onote the subset of all points where the 
optimal path crosses over from the control region to the 
control region Cj. We assume that int(C^j) -interior in the 
relative topology of a - is a ( 2 n-D- dimensional smooth 
manifold and C intfc^Tf this implies that is a 
( 2 n-1) -dimensional manifold with boundary (we do not exclude 
the possibility of this boundary being the null set) . 
Situations where this assumption does not hold will bo 
considered later. We propose to carry on the analysis in the 

following three steps : 

(1) First we suppose that every time the optimal path from 
[x /X ] crosses over from a control region, say, 
to°a Lntrol region, say, Cj it does so at a point 
belonging to and the optimal control suffers 

a discontinuous switching there. 
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( 2 ) In the second step we allow the optimal control to 
switch in any manner# the optimal path however still 

continues to pass through Int (C^^). 

(3) In the third step we take off all restrictions# 

the optimal path is allowed to pass through 
M 

C. .—Int (c. .) as well and the optimal control switches 
X3 

in all possible manners* 

Stop 1 

Lot interior point of the control 

region and let S = • It is then possible to 

find a neighbourhood 0 ^ of relative topology 

of M* such that CS • We choose a 0 ^, sufficiently 

small, ° so that its size does not hinder the argxaments that 
follow* is evidently a submanifold of having 

co-aimonsion n (all co-dimensions in this discussion are with 
respect to T’^M) . Hie optimal path from a point in is 
along the integral Ourve of the vector field Xj^ through that 
point. Since is a diffeomoxphism, it follows that 

#l(t,aQ) is a submanifold with co-dimension n-for all t. 

Lot the optimal path from I^So'^o] boundary 

3Ci in time t^ > 0. If no such t^ exists it follows that 

the optimal control, as we traverse the optimal path from 

onwards, suffers no switchings. This immediately 
implies that along the optimal trajectory, from x^ e M# which 
arises as a projection of this optimal path, the optimal 
control suffers no discontinuity. This is a trivial case of 
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our analysis* We therefore assume the existence of a finite 

^ 0# Lot 

[xq/Xq]) s ac^ . 

Ihc optimal path crosses over fiom the control region into 
the control region C 2 (say) at [xj^/X^.] • assumptions 

imply that the optimal control switches discontinuously here 
and [x^/X^l] ® Int^CCj^^)* Further^ it is trivial to note that 
[xj^/Xj^] jk [xj^/O]* Wc now impose the crucial condition ; 

The System (1.1) is such that 

M w 

flCt/Jo) ^ Int V t > 0 . (3,2) 

Consider the intersection 

M •• 

aj. = n Int 

is non-empty as ® it follows from (3*2) 

that it is a submanifold of with 

M 

codim = codim codim Int (^ 2 ^ 

=s n+1 

Prom the nature of the vector* fields it follows 
that if [x,X] = $i(t,[x/X]) then [x/ttX] = ^(t, [x^ax]) 

f ojc 3.X1 ircQ.1 oc ( foiT/ the co**stcitG Gcjucitions gjtg linGQ-ic in 
thh co-state variable and juLj.([x,X]) = *li( [x^aX]) ) . Moreover^ 
from the nature of the Hamiltonian it follows that if the 
optimal path through [x/X] crosses over from the control 
region to tho control region Cj at [x,Xj then the optimal 
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path through [x,aX] does tho same at [x^ax] for all real 
a > 0« 

Now as 0Q CM* is open in the relative topology of 
^o 

M* / it follows that for all [x/X] 6 the points [x/6X] 
also belong to for all 6 in the interval [l— e^l+s] 
for some sufficiently small positive e, We therefore conclude 
that [x'/X^] ® <7^ ==> [x'/(5X'] e for all 6 € [ l*-s',l+e' ] 
whoro e' is some sufficiently small positive real number* 

Wo started from with co-dimension n and reached 
having co-dimension n+1* The natural projection of into 
M was a single pointy “ ^o* other words/ at the 

dimensional freedom along M was zero while there was 
n—dimonsional freedom in the cotangent space* At 0 ^ the 
dimensional freedom eilong M would have increased in general/ 
i*o*/ 7r(au_) could possibly have gathered a non-zero 
dimensional extension* This increase would be at the cost 
of dimensional freedom in the cotangent space* However, as 
evident from the discussion in the preceding paragraph, there 
always exists a one dimensional freedom, at least in the 
cotangent space* 

The optimal path from every point in is along tho 
integral curve of the vector field X 2 . Lot the optimal path 
from tho point ® boundary again in 

time t 2 > O. If no finite tg exists then we conclude as 
before that the optimal control, as we traverse the optimal 
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path from [ / suffers just one discontinuous switching# 

the switching being at the point • This again is 

trivial# we therefore assume the existence of a finite t 2 > O* 
Let 

6 3^2 * 

The optimal path crosses over from the control region into 
the control region (say) at [x 2 A 23 * assumptions 

imply that the optimal cgntrol switches dis continuously here 
and [x 2#^2^ ® Int‘’'^(C 2 3 ) . We again stipulate that 

$ 2 (t#a 3 _) In/'(C 2 3 ) V t > 0. (3.3) 

Consider the intersection 

M,- . 

a 2 is non-empty as [X 2 A 2 ] ® ^2 follows from (3.3) 

that it is a submanifold of with 

codim G 2 “ n+ 2 . 

We will continue to have similar situations every time 
the optimal path crosses over from one control region to 
another if at every stage we stipulate the aiffeomorphio 
image of to be transversal to the corresponding open 
subset of the boundary, For, suppose the above situation 
holds at the (m-l)th stage (m < n- 1 ) 
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— i*e«/ traversing the optimal path we have reached the point 

manner, 

tl > 0/ tg > 0,...,t^^j_ > Of 

M ^ 

^m— 1 Q* ^*^m-l m^ ^ sutoanifold of I’^M with 

codim a - =s n+(m-l); 

and at y where the optimal path crosses over from 

1 Sn' optimal control suffers the (m-l)th 
discontinuous switching 

— we now show that similar situation holds in the mth stage# 

The optimal path from every point in is along tho 

integral curve of the vector field X^, Let the optimal path 
from tho point meet the boundary 90^^^ again in 

time > O# If no finite tj^^ exists, we conclude as before 
that the optimal control, as we traverse the optimal path from 
fx /\ 1, suffers (m-1) switchings only# This again is trivial, 

WG therefore assume the existence of a finite t^ ^ 0# Let , 

! 

i 

( . i 

e 9C^ . i 

m [ 

I 

f 

t 

j 
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The optimal path crosses over from the control region into- 
the control region (say) at f assumptions 

imply that the optimal control switches discontinuously here 
and e Int (C^ We stipulate : 

The System (l.l) is such that 

^ m+l> 

If now we write 

M/“ s 

®m = ‘°i« nn-l^ 

then is non-anpty as ® <5^ and from (3.4) it follow 

that it is a submanifold of T'^M with 

codim cfj^ “ n+m» 

This procesSi however# cannot continue indefinitely* 
Consider the (n-l)th stage# we have reached the point 

"n-l C „) has coaimension 2n-l, 

The optimal path from along th® integral 

curve of the vector field and under our stipulation either 
of the following and none else is true ; 

Ci) the optimal path from not meet 

again# 

(ii) the optimal path from Xn-l3 

at [x^/Xnl ® n+1^ 

$^(t#a^.l) ^ V t > o. 
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Now suppose ( ii) is true, then has zero dimensions* This/ 
hov/ever/ is not posgsible. For# as we have seen before/ 

[x#X] e a=> [x, 5 X]s for all 6 e [l-e,l+6] and for 
all possible m# where s is a sufficiently small positive 
number* Hence (i) is true# in other words the optimal path 
from the point C^-l^\i-l] remains in the interior 

of the control region C^# We therefore conclude that the 
optimal control# as we traverse the optimal path from 
[:>^#Xq]# does not suffer more than (n~l) discontinuous 
switchings. 


At the outset of this analysis we assumed that 
belongs to the interior of the control region Cj_* Now 
suppose [xq#Xq] lies on the boundary of some control region. 
We then traverse the optimal path from [3 ^#Xq] for a negative 
time and reach a point [x'/X^]# say# belonging to the 
interior of some control region* We denote this control | 

region by C* and take [xJ/X'] as our starting point. The ■ 
optimal control as we traverse the optimal path from [x:q/Xq]j 
docs not suffer more than (n-1) discontinuous switchings ==> | 
it does not suffer more than (n-1) discontinuous switchings | 
as we traverse the optimal path from t^o^^oJ * i 


The condition that 

for every possible, m will be celled the transversal Ity 
condition I, We have therefore proved the following : ^ 


I 

LfA. "C? I 


i 

1 

I 
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Consider an optimal path of the System (1.1)* Suppose/ 
every time the optimal path crosses over from a control region 
to a control region Cj it does so at a point belonging to 
Int^Hc^^) and that the optimal control suffers a discontinuous 
svTitching there* If the System (1*1) satisfies the 
transversality condition I then the optimal control/ as vre 
traverse this optimal path, does not suffer more than (n- 1 ) 
discontinuous switchings. 

as=> The optimal control - considered as a function of time 
governing the optimal trajectory (in m) which arises 
projection of the above optimal path does not suffer more than 

(n~l) discontinuities. 


Stop 2 


Consider the mth stage in the above analysis* 
optimal path crosses over from the control region C^ to the 
control roglon « 6 w Instead of supposing (as in step 1) 

that the optimal control suffers a discontinuous switchrng, 

ors it switches continuously as the optimal 

let us suppose that it swi^ 

r to C - • The optimal path as long c 
path crosses over from Cjjjj 

• r (c ), Is along some integrs 
it lies in the control region C^^ 

X (X -t ) • possible to 

curve o f the vector fa m +1 

J. 1 . 4 : • ri X a defined everywhere on 

find a CT vector field 

V » » -r J^fr ) such that 

IntCC^) V lnt(0^^.i) U int (C^ 




^ ^ ^ when [ X/X 3 ® 

X^^^([x/X]) when [3C,X] e 
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Cl early / 5^ can be extended smoothly up to • 

Hie optimal path from onwards is now along an 

integral curve of the vector field and it (the optimal 

path) continues to be so even after it has crossed over to 
the control region Now is the point where 

the optimal path crosses over from to and by our 

assumption (for Step 2) 

It is therefore possible to carry over diffeomorphically 

past into the interior of the control region 

where the diffeomorphism in question is generated by the 

same vector field - viz., - which generates the 

optimal paths in the two control regions and On 

1 

the other hand, if it is not possible to find the C vector 
field m+l' also not possible to transport 

diffoomorphically into the control region in the manner 

suggested above and we encounter a situation similar to that 
in step 1 where the switchings were all discontinuous* Lot 
and be the extremal siirfaces that represent the optima 

control in Int(C^) and lnt(C^_j_^)* 

We say that the optimal control switches non-smoothly 
from ^ to at if it is not possible to find 

the vector field X^ m+i* 

Note that non-smooth switchings not only include the 
type of continuous switchings just discussed but also include 
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all discontinuous switchings* Therefore, to continue the 
process of reasoning as developed in step 1 it becomes 
logically necessary to make the following stipulation ; 


The System (1*1) is such that 

int (0^ V t > 0 

whenever the optimal control switches non-smoothly at 
and we write 


M •• 

“m = <=n, 

On the other hand if the optimal control switches smoothly 

at i*e*, if it is possible to find the vector 

field X , we write 
m m+1 

'^m “ ^m^'^'^m-1^* 

CTj^ therefore has the dimensions of when the switching 

is smooth* 


The condition that 

. V t > O 

for every possible m for which the optimal control switches 
non-smoothly at will be called the tranversality 

condition II. 


It is now evident that as far as tranversality 
condition II goes there is no point in maintaining differenl 
identities for the two control regions and if the 

optimal control switches smoothly at every point in 
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Int (C^ m+l^ this is true for every pair of control regions 

similarly related. We also recall that Int (C . ) has been 

m m+l 

assTomed to be a (2n— 1) dimensional submanifold and this 
assumption has been implicitly incorporated into the constructic 
of the vector field (when it is possible to constoract 

^m m+1^* m+1^ found to have lesser dimensionality 

then by our convention the optimal control switches non— 
smoothly at every point in m+l* Further we assume that givei 
two control regions and Cj the submanifolds Int snd 

Mx- . 

Int have the same dimensionality. These facts taken 

together make it necessary -* in the sense that it would be 
unnatural otherwise — to combine (taking union) all adjacent 
control regions/ between which it is possible to switch 
smoothly, into a single control region. When this is done 

I 

wo obtain a new partition of Z, The partition sets of this 
partition will again be called control regions and we will 
use thie symbols Cj|^ to denote them. This will create no 
confusion, as of now, we will be dealing with this type of 
control regions mostly and wo reserve the term "old control 
regions" to be used when an occasion arises to refer back to 
the control regions as defined earlier* It is now evident 
that whenever an optimal path crosses over from one control 
region into another the optimal control suffers a non-smooth 
switching. We further assume that these control regions 
and the sets possess all the properties of the old control 

regions and the old 
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Wc repeat all that has been done until now# In other 
vrords we traverse the optimal path from the point 
assuming that every time the optimal path crosses over from 
a control region to a control region Cj it does so at a 
point belonging to Int assign the same meaning to 

the symbols and others, as were 

assigned in Step 1* , We however rewrite the transversal ity 
condition II as follows : 

M — 

TH Int (c^ V t > O 

for every possible m* 

All switchings being non-smooth switchings now, the above 
condition is equivalent to the earlier transversality conditioi 
II* An analysis similar to the one developed in Step 1 then 
proves the following : 

Consider an optimal path of the System (1*1)* Suppose 
every time the optimal path crosses over from a control 
region Cj|^ to a control region Cj it .does so at a point 
belonging to Int "the System (1*1) satisfies the 

transversality condition II then the optimal control, as we 
traverse this optimal path, does not suffer more than Cn-1) 
non-smooth switchings* 

=— > The optimal control - considered as a function of time - 
governing the optimal trajectoiry (in M) which arises as the 
projection of the above optimal path does not suffer more 
than (n-l) non-smoothnesses* Here by a non- smoothness of u( t) 
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wc either mean a discontinuity or a hon-differentiability. 

In the analysis above/ the sets were all assumed 
to bo ( 2 n-l)— dimensional submanifolds with boundaiY" The 
analysis suffers very little change if we deviate from this 
ass\amption and take to be submanifolds with lesser 
dimensionality. Consider the mth stage again. Let 
be a 2n-k dimensional submanifold with boundary^ 1 < k < 2n. 

e Int (C^ iri+l ' this is in keeping with the analysis 
in Step 1 and Step 2. We extend the transversality condition 
II to this case in the most natural way by stipulating that 

M — 

^ V. t>0 

and write 

dr,, = S,„(t„/a_ -) n Int (C^ „.i)* 
m *m m m—1 m m+1 

*^m' clearly/ assumes a codimension greater than n+iu. On the 
whole therefore the number of non-smooth switchings of the 
optimal control gets reduced if the dimensionality of z 

at any stage m^ gets reduced. Finally/ if fails to 

qualify as a manifold the above analysis does not apply. 

Step 3 

Here we take off all restrictions that were imposed 
on the system. Consider the mth stage again. The optimal 
path crosses over from the control region C^ into the control 
region C^_^^ at • We now allow the point 

lie in(C^ other words the point 
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boundary of (by * boundary of C, /* 

iNl ^ 

WG mean the SGt(C^j - Int (C^j)) for the rest of the chapter)# 

Hence there may exist other such sets such that also 

belongs to the boundaries of these sets* Let us for a while 
, vj’ithdraw the symbol from the analysis and consider the 

following situation# 


Let A ^ denote the set of all points of the 

■typ® such that belongs to the boundary of 


each of the sets . ■ . .C^^t and 


none else and let the optimal path from the point - 
meet again at the point e a • Since A is 

coironon to the boundaries of the above (k+p) sets we cjgpoct to 
find points in A where the optimal path crosses over from 


to C . , r = 1/2,. ..^kji similarly there can also exist 

j-r 


points in A at which the optimal path crosses over into C 


m 


from C, / r = l,2/.#./p# Let us denote by AC. . the sot of 


ij 


all points in A where the optimal path crosses over from 
into Cj* In keeping with our original assumption 


regarding the sets C^j (see page 68 ) we assxame that AC^j 


is a smooth submanifold with boundary# A can now be expresse 
as the disjoint union 


A = AC . U AC . U 

m-i - ^ mn V» 




ml.. 


U AC . U AC. 


U AC. ... Uac. 

12"^ Jptn 


We carry on the analysis in the following two steps# 
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(1) Consider the case where ® ^ where 

is an element of the collection {i^/i 2 / .../i^} . VJe take 

the points in AC . to be of a different nature from the 

mi. 

qi . 

points in Int (C^^ )• This in turn prompts us to consider 

AC^jj^ to be a separate entity in itself* To make the 
^1 

analysis compatible with the process of reasoning developed 
in Step 1 and Step 2 it then becomes necessary to stipulate 
that 

^ ^ t > 0. 

This condition will be called the boundary transversal ity 
condition* We write 

"m = «m(V %-l> n ) 

■31 

and to avoid unnecessary confusion denote the control region 

C^ by the symbol C^_^^ instead* Clearly 
qi 

codim > (n+m) 

M •• 

for, codim Int (AC , ) > 1* 

mi 

(2) Next consider the case where 

qi 

M ~ 

-« Int CAC^^ ))• In other words the point ] lies in 

the boundary of the set AC^^ (i*e* the set 

M • 

- Int (SS" . ))). Hence there can exist other such sots, 

"'"■qi 
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say, ; AC, AC, AC 


^2 


^3 


1 m' ^ i 

‘Jl ^2 


j 


q 


xsrhere i, i are elements in the collection 


qi q2 


(i-, /i^,a.», i-. } and j ,j are elements in the 

^ ^ "21 ^^2 . 


collection £ / j 2 / • • • / jp} — -such that belongs to the 

bounda;^ of each of these sets and, say, these include all 

such sets. Let A^ denote the set of all points in A which 

are common to the boundaries of the above (r+s) sets and none 

else* Then A' when considered with reference to the above 

(r+s) sets is equivalently situated to A when considered with 

respect to the (k+p) sets C^, m* 

1 k ■'p 

therefore analyse the points in A' in exactly the same 
manner we analysed the points in A and write A' as the 
disjoint union 


A^ = A'C^, \J A'C , VJ *.* U A'C^, 
mx mx mx. 


qi 


^2 


UA'C A-C ^ U... U A'C ^ 

% *22 ‘Js 




Now if ® 2;nt (A'C^^ ) we rewrite boundary transver- 


mi 


^21 


sality condition as follows 


31 


Note that we have considered A^C^^ to be a separate entity 


^21 


in itself* We further write 
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— i-r 

Wl> n Int <A'C^^ ) 

and denote by instead. It is also clear that 

qi 


codim CT^ > n+m 

for/ codim Int (A'C^^ ) > 1, 



If on the other hand 1 6(A'C . - Int^(A''C . )) 

'mJ mi 'ml 

qi qi 

we construct a set A" from L* in the same way we GonstructG< 
tif starting from A and so on* In all cases, however, the 
inequality 

codim o > n+m 
m 

remains valid* 


At this stage we would like to mention a somewhat 
different situation. Let A C be the set of all points 

such that [^C/X] s A implies that [^/X] belongs to the 


boundary of each of tho sots ... 


► / , C,. _ and none else* For such a set A it is 


mk ' km 
P P 


possible to find points inside it such that an optimal path 
when it reaches this point turns back into the control 'region 
from which it came. Lot 6 C. A be the set of all points in 
A at which the optimal path turns back into C^* We assume 
that 6 is a smooth manifold with boundary* Now suppose 
the optimal path from [ /X ] meets 9C^ again at tho 


r M 

point e Int (6). To be consistent with the process 
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of reasoning developed so far we stipulate that 

^ Int^(6) V t > O . 

This again is the boundary transversality condition and we 
write 

M «« 

% = C6) 

in which case 

codim 0^ > n+m. 

Note that we have taken 6 to be. a separate entity. Tliis 

follows from the same logical necessity which did prompt us 

to consider and ' as separate entities* 

^ ^1 

Further in this case whore the optimal path turns back into 
m m+1 m • 

We say that the System (1.1) satisfies the transversa- 
lity condition if 

(i) for every possible m for which ® m+1^ 

the condition 

M — 

(C^ V t > 0 

^ % 

holds j 
or if 

( ii) the boundary transversality condition is satisfied. 

We have finally proved the following i 
If the System (1.1) satisfies the transversality 
condition/ the optiinal control/ as we traverse an optimal pat? 



the set of points optimally reachable from x possesses a non— 

empty interior in M— the upper bound on the number of 

discontinuities is a number not loss than (n— !)• In systems 

for which there exists an uncountable nvimber of vector fields 

to manoeuvre with/ we enjoy more freedom in trying to reach 

different points in M from some given point# For such 

systems it may be possible to have the value of the upper 

bound less than (n-1)# We# however/ in our analysis^ have 

nowhere made provision which ensures an uncountable number 

of vector fields to manoeuvre v/ith/ nor does the fact that 

the system satisfies the transversality condition ensures it 

/ 

— in fact the linear systan x = Ax +- taU/ —1 £ u < 1 # for v/hich 
there exist just two vector fields to manoeuvre witii 
(see page 20) satisfies the transversality condition/ under 
some constraints? we sec this in the next section* The upper 
bound therefore/ has to be a number greater than or equal to 
{ n-1 ) . 

3—3 Pe]fdbaum* s Switching Theorem 

To illustrate the conclusions obtained in Section 3-2 / 
hero wo re-establish the Fel&Daum'^s switching theorem 
[l/ p*120? 2/P*1432# 

Consider the linear time optimal problem where, the 
state equations are given by 

X = Ax + bu? — 1 < u < 1* (L) 

The variable x e IR^ z A is an nxn real matrix and b is an nsd 
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real vector* We assume that the System (l) is controllable/ 

X « o » ^ 

rank (b/Ab, , ,,,A^~’^b) = n 

Pel^dbaum's switching theoron for the linear time 
optimal problem for (L) asserts s when the eigenvalues of A 
are all real the optimal control governing any optimal trajeeb 
suffers at most (n-1) discontinuities# We therefore prove the 
following proposition# 

Proposition 

Let A have all real eigenvalues# Then (L) necessarily 
satisfies the transversality condition as a consequence of whi 
the optimal control governing any optimal trajectory docs not 
suffer more than (n-1) discontinuities# 

The following two lemmas are essential to prove the 
proposition. 

Lemma 1 

Consider the homogeneous linear system 

X = Bx/ (*) 

where x e and B is an nxn real matrix# Let $^(t, •) denote 

the one parameter group of diffoomorphisms generated by the 
vector field Bx on • We have the following : 

( i) If S ^ is a hyperplane given by 

<a/X> =f 0 

where a is an nxl vector/ the diffeomorphic imago f*(-t 
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IS also a hyporplane given by 
T 

-B t 


<e a, X > =s 0, 

( ii) If and are two hypcrplanes in with 
then 

f^(t,S2) V- t. 

Proof : (i) Consider the adjoint equation 


y = -.B^y 


and let x(t) and y(t) satisfy (*) and (•»■») respectively, Tlion 
the scalar product 

<x(t)/y(t)> s= constant. 


For/ ^ <x(t)/y(t)> = <x(t)/y(t)> + <x(t)/y(t)> 


,T 


=! <Bx(t)/ y(t)> "• <x(t)/B y(t)> = 0, Solving Equation (**) 

with y(0) = a we obtain 


T 

-B t 


y(t) = e a, 


Now as <a/X> = O for all x e s, it follows that 


T 

-B t 


<e a/X> = 0 
for all X a $^(tjS). 

(ii) Since f^v^(t/S^) and $^^(t/S 2 ) are hyperplanes they must 
either coincide or be transversal, Howeyer, if they do 
coincide/ the fact that (-it) has a unique solution through any 
initial point will be violated and hence §^(t/S^) and ^^(t/S^) 
are necessarily transversal. 
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Lemma 2* 

Let A have all real eigenvalues. For any choice o£ 

m < n distinct times integers m^ > 0 

^ At. 

with E m. = n, the n vectors e -’A b (1 < k < m../l < j < m) 
j=i ^ _ _ j - _ 

are linearly independent* 

. See [is] for a proof* 

Proof of the Proposition : 

For the System (L) the state space M = 
and for x s M/ = HR^* The co— state variable X satisfies 
the equation 

^ = -A^X . (L') 

The Hamiltonian is given by 

. H ?! <X,Ax> + <X,b>u. 

Z/ the set of 'all points in T^M through which there exists 
an optimal path, is given by 

Z = £{x,x] s X 0 and 

<X,Ax> + l<X,b>\ > 0}* 

The optimal control is bang~bang and takes values +1 and “*1 
alternately* Z is therefore partitioned into two control 
regions and C 2 where 'the optimal control remains constant 
at the values +1 and —1 respectively* In the relative topolo 
of 2 ^ and C 2 have a common boundary given by 
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= {[xA] e Z ; <bA> = O}. 


Further 


Int(C3_) = £[x^x] e z : <b/X> > 0} 


Int(C2) = {[xA] « Z : <bA> < 0} . 

We take a point [^^Xq] ® Int(C^) and study the numbej 
of switchings that the optimal control suffers as we 
traverse the optimal path through [^q'Xq]« 

The portion of an optimal path lying in is nothing 
but an integral curve of the system of equations 


cLx 

II = Ax + b 

(L^) 

II 

CL') 


Let be the vector field defined on T^M by the imle 


Kj.([x,X]) = 


Ax + b 
T 

-A X 


and let §^(t/*) be the one parameter group of diffeomorphisms 
generated by Similarly the portion of an optimal ipath 

lying in C2 is nothing but an integral curve of the system 
of equations 



Ax - b 


(L^) 




X 


CL') 


Let X_^ be the corresponding vector field defined on and 

§ (t/.) be the one parameter group of diffeomorphisms genorat 


by X^« 
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The optimal path from first crosses over 

(if at all it crosses over) from Cj^ to then from to 
and so on* 

Let ® • To see which way tho optimal path 

crosses at we have to examine the Lie derivative 

L^ or equivalently L^ at this point , where 

= <X,Ax> + <X/b> and 
= <X^Ax> - <X^b> • 


Now 


Lv H 

" 




dxv 


+ < 


3H 


dXv 


FT ' ^ 


Here and ^ are along the integral curve of 
therefore have 


We 


dx 


= Ax + b and 


dX 


aT , 

= —A X 


Hence 

L^ = <A^X,Ax+b> + <A3e-b^-A\> 


= 2 <X,Ab> 



Similarly 


L^^ H^ = -2 <X,Ab> = -*L_^ 


where 


-X+([x,x]) = 


—Ax - b 
T 

A-^ X 



3 
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and 




-Ax + b 


T 

A-" X 


Prom the condition of controliability of (L) it follows 
that the two vectors 


”0 


0 

0 


0 

• 

OCnth) 

s IR^^ and 

• 

O(nth) 

b 


g 

1 


are linearly independent* The hyperplanes 


<?\,/b> = 0 and <X/Ab> = 0 


therefore intersect trans vers ally* Let ^ point 

in .A 3 ^ such that <X'/Ab> >0* At this point we then have 


and 


Lv H . < 
X + 


0 < 






< 0 < L 


X 





Prom the Relation (*) it follows that decreases as we 
traverse the integral curve of X_^ from the point [x^/X^| 
onwards and that increases as we traverse the integral 
curve of from the same point, [x'^/X^ ] * onwards* Maximum 
principle therefore dictates that the optimal path from tiie 
point onwards and close to it is along the integral 

curve of the vector field X^* Similarly it follows from 
Relation (**) that we reach the point' [x',x*] along the 
optimal path which is nothing but an integral curve of the 
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vector field X^* In other words the optimal path crosses 
over from the control region into the control region C 2 
at [x'/X'J « We therefore have the following : 

In the relative topology of 

Int(C^2^ = e s <X^Ab> > 0} 

IntCc^j^) = {[xA] e a^^ : <\,Ab> < 0} 

and these two sets have the common boundary 

Aj = {[x,X] e Ai : <X.Ab> « 0} . 

What happens , to an- optimal path when it reaches a point 
belonging to L2- can be ascertained by examining the higher 
order, Lie derivatives as said before* These derivatives ^ 
as can be easily verified/ are given by 

^5 = (-1)"''^^ 2<\/A"^b> = -iF H. 

and 

= -2 <X/A"‘b> = 

where 

The hyperplanes 

<K,h> a 0, <\,/Ab> a 0 and <A/A^b> a 0 

intersect transversally (thanks to the controllability of (L 
We therefore write the following 

Pi = {[x/A] s A 2 s <k^A^b> > 0} 
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Gj_ = {[x,\] e A2 : <X,A^b> < 0} 

and 

A3 - {[xA] e Aj : <X#A^b> = 0}, 

Clearly A2 is the disjoint union 
^2 = PjU UAj. 

Let ® ^ir Then at have 

° ‘ 4 

1 ? H_ < 0 < 

+ 

A little consideration now shows that we reach the point 
l^V^al along the optimal path which is nothing but an 
integral curve of the vector field X^ and that from the 
point onwards the optimal path continues to be 

along the integral curve of X^* In other words at [ 
the optimal path turns back into the same region^ viz* 
Int(C^) from x^Thich it came and this is true for all points 
in Fjj^ • Similarly at all points in the optimal path turns 
back into the region Int(C2) from which it came* To see whi( 
way the optimal path turns when it reaches a point belonging 
to A3 we have to partition A3 in the same way as we 
partitioned A2* If we continue this process we obtain a 
partition of A2 in fhe sense that A2 can be expressed as 
the disjoint union 



97 


*2 = Pi U GiU I'2U GjU ... UP^_2 O G^.2 

where 

J’k = C[x/X] 8 ^2 : <X/A^b> » <X,A^b> = ... 

= <X,a\> = O 
and <X,A^'*'^b> > 0} 

and 

Gj^ = £[xA] 8 A 2 : <X,A^b> = <XyA^b> = ... 

s <X,A^> = O 
and <X,A^'''^b> < 0} 

At a point [x/X] 8 we have 


II 

11 

’ 4 . «- = 

... = lJ^ 


4 - 

A4. — 

II 

fT 

+“ 

= H^= 

— 4 _ '^+ 




ffi 

i 

1 

II 

It 

X 

4 

11 

... = l\ h 

-^+ - 




= ^-X_”+ “ ^-X_»+= — = ^-x_«+ 

and 

< 0 < 

.V + 

if Ic is odd/ 

H < 0 < H 


if k is even 
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and finally 

H_ < 0 < . 

Hence if k is odd the optimal path turns back into the region 
Int(CjL) from which it came at all points [x,K] s p^. On the 
other hand the optimal path crosses over from Int(C^) into 
Int(C2) at all ® if k is even. Similarly at points 

[xA] e o^. the optimal path turns back into Int(C2) from which 
it came if k is odd whereas for even k it crosses over from 
IntCC^) into Int(Cj^) at points [x#X] ® 

Now the switchings encountered along an optimal path of 
the System (L) are all discontinuous switchings* The fact that 
(L) satisfies the transversality condition will therefore be 
established in the following two steps* 

Step 1 * First we suppose that every time the optimal path from 
] crosses over from to C2 or from C2 to Cj^^it does 
so at points belonging to Int(C^2^ (interior in 

the relative topology of A]^) respectively and show that (L) 
satisfies the transversality condition I* 

Step 2 * Next we allow the optimal path to pass through points 
belonging to A2 and show that (L) satisfies the boundaiy 
transversality condition. 

Before proceeding along the lines indicated above we note 
that the state and co-state equations are decoupled everywhere/ 





the Equations ( l' ) are decoupled from the Equations (L^) 

and also from the Equations (L_^)* Let S = CL f) and let 

“ ■ ^o 

bo a neighbourhood of th® relative topology of 

M* such that C S • Consider the set Is't p be 

o 

any point in this set then 7r(p) =3^ where 5 ^ s m is that point 
which is reached in time from the point Xq# traversing the 
trajectory of the system of Equations (b^) defined on M* Hence 

We can think of the set in the following alternative 

way. Consider the co-state space Let X' be a vector field 

defined on by the rule X'(X) = -A^X and let f'(t,.) denote 
^o 

the corresponding one parameter group of diffeomorphisms. Now 
a C* M* / construct the set 

f (tf^cr^) considered as a subset of M* is equivalent in all 

respects to considered as a subset of M* • Let 

be the subset of defined by 

^o 

A( = { X s j <X,b> = 0} • 

Then 

Tfi \ 

if ) (transversal as subsets of ) 



Lot us write 


O 

and/ say/ cr-j_ and cj| are nonempty, cr^ considered as a subset 
of is equivalent in all respects to a£ considered as a 
subset of M • Construct the set m For every point 

p 8 ^^(t^/Cr^) we have 7r(p) » Xj where ^ is that point in M 
which is reached in time t 2 / from the point Xj_/ traversing the 
trajectory of the system of equations defined on M* We 

therefore have 

#^(t2/d^) C 

Again as before §^it2t0^) considered as a subset of is 

equivalent in all respects to §'(t2/<7£) considered as a subset 

of M* / furthermore 
^o 

t2 

if f * ( t 2 ) "^ Aj (transversal as subsets of ) 

and so on* 

We therefore consider the following equivalent probl<^* 

In the n-dimensional space IR^ of the co-state vector X 
consider the following subsets 
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a {\ e : <\,b> a 0} 

Int(C^) a {?y.e3R^ : <X^b> > 0} 

Int(cp a {X e ]r” : <\,h> < 0} , 

We have/ no doubt/ created some confusion by defining IntCc^ 
and IntCcp without defining the sets and C' a priori# The 
two sets as defined are however open subsets of IR’^/ we could 
have denoted th^ by any other symbol/ but have chosen the 
symbols Int(C^) and Int(Cp) to emphasize the fact that the 
manner in which they are related to the equivalent problem is 
similar to the manner in which the sets Int(Cj^) and Int{C 2 ) are 
related to the original problem# Int(c£) and IntCC^) have as 
their common boundary the set 

We also write 

A' ' » { X e : <X/Ab> = 0} 

IntCcJz) “ £X 8 A* ; <X/Ab> > 0} 

intCCjj^) « £XeA£ i <X/Ab> < 0} « j 

n i 

IntCC^^) and IntCc^]^) are (n-1) -dimensional submanifolds in 3R“ j 
and have A^ as their common topological boundary in the relative ! 

topology of a£# ' 

i 

I 

Let be the vector field defined on IR“ by the rule j 

1 

I 

x'(x) = -A^x ! 

j 

and let §*(t/*) be the one parameter group of diffeomorphisms • 
generated by X*# i 


I 
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If AS Int.(C^ 2 ) then 

<-a’\ ,ii> S -.<X,Ah> < 0, 

This implies that the integral cxirve of X' through A crosses 
over from Int(Cj[) to IntCC^) at A • Similarly the integral 
curve of X* crosses over from IntCC^) into IntCC^ at points 
belonging to Int(C 2 ' 2 _)« At points A 6 we have <-A A/b> = 0* 
To see which way the integral curve of X* turns at these points 
we first observe that along this integral curve 


(-1)“ (A>X and 


dt 

d^A 




« (A^)^ A . 


d(~t)' 

-*> ^b> = (-1)’^ and 


dt 

ain 


dC-t)"" 


Next, keeping in mind the way A 2 partioned, we write A^ 

as the disjoint union 


aJ = F' V> ^ U U ••• U F^_2 ^ ®n-2'^ 

where 

pj. » £A s A' ; <A,A^b> = <A,A^b> = ... 

« <A,A^> =! 0 


and <A,A^‘*‘^b> > 0} 
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and 


Gj. * {X « s <X,P?h> = <^,A^b> s ••• 

= <X,A^> = O 
and <X,A^‘^^b> < 0} 


Now consi^^Q^ ^ point X e F(.- Then at X we have 


> <3. X 

TjJ t> = - T ,b> = 0 for i = l,2,...^k 

d(-t)'" 

^X+1 

Ic+X^ #b> > 0 if k is odd 
dt 


< 0 if k is even 


and 


,k+l 



> O, 


It therefore that at points in the integral curve 

of turns into Int(C^) from which it came if k is odd 

and crosses over into Int(cp from Int(C£) if k is even. 
Similarly joints in G]^ the integral curve of turns back 
into Int(C 2 ) vrhich it came if k is odd and crosses over 

from Int(C 2 ) ij^^o IntCC') if k is even. 


Step 1, that the point [x^/Xq] © Int(C^)/ this implies 

Xq ® lnt(Cj|_), first assume that the integral curve of X' 

through X^ never meets A^/ in other words we establish 
transversality condition I as a first step towards establishing 
transversality condition. Let 0^ be a neighbourhood of X^ 
such that a* ^ Int(c£) and let it be small enough such that 
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its size does not hinder the arguments that follow. 

We start from and traverse the integral curve of • 

Let this integral curve meet for the first time in time 
t-j^ > 0* If no finite exists then the transversal ity 
condition is trivially satisfied and the optimal control as 
we traverse the optimal path (of the System (L)) from 
onwards suffers no switchings. This immediately implies that 
along the optimal trajectory, from e M, which arises as the 
projection of this optimal path, the optimal control suffers 
no discontinuity. This is trivial. We therefore assume the 
existence of a finite t^^ > 0. Let 


0 since X^ 0. . 

Evidently X^ s IntCC'^) and the integral curve of X' crosses 
over from IntCCj) into IntCc') at the point X^. • Since is 
an n-dimensional submanifold of we have 

i»(t,a^) % Int(Cj2) > 0 C3.5) 

Consider the intersection 


a£ = $'(tj_,cJo) n IntCCjg)* 
is non-or*,ty as ^ ® <^1 aad It follows from (3,5) 
is a submanifold of IR with 

codim oj = f • 


that it 
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Further a£ is an open subset of A£« (For two subsets A and B 
in / we say that A is an open subset of B if A ^ B and is 
open in the relative topology of B) • 

Let the integral curve of X' starting from meet 
A£ again in time t 2 > 0* Non-existence of a positive finite 
t 2 is again trivial. In other words starting from and 

traversing tiie integral curve of X' we meet A£ for the second 
time at ^2 time tj_+t 2 . 

^ ^ ^2 “ §^(t2#X]^) - ( t|^+t2/X^) ® a£ 

Evidently ^2 ® Int(c£^) and the integral curve of X^ crosses 
over from Int(C£) into Int(C£) at t > O, §Mt#A£) 

is the hyporplane given by the ecjuation 

<e^^b,X> =s 0. 

Since the vectors e^^ and b are linearly independent (Lemma 2) 
wo have 

Since aj is an open subset of Aj « have 
*'Ct,op ?R Ai W t > 0 - 

Again since IntCCj^) Is an open subset of A£ we have 

,'(t,<r{) 31 Itt(C'i) V t>0, (3.6) 

Consider the intersection 

<r' = #'(t2,a£) n int(o'i). 


\ 
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aj is non-empty as ® ^2 follows from (3,6) that 

it is a submanifold of Ir” with 

codim - 2, ’ . 

Further 0^ is an open subset of 

We will continue to have similar situations every 
time the integral curve of X' through meets a£* For^ 

suppose the above situation holds at the (m-l)th stage# we 
take m to be odd and less than or equal to n-1 

- i*e*# the integral curve of from meets A£ for the 

(m-l)th time at in the manner 

tj > O, ^ 

Vi ® 

( recall that m is odd)/ the integral curve of X' crosses over 
from IntCC^) into Int(C^) at 

^ Int(C^l) f t > 0/ 

' Cl “ C2> ^ IhtCC'p, 

’'s^l ® Cl ' 

a' « is a submanifold of with 


codim 
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and finally a„ 

»'(t2+t3+.,,+t^_^. A{) 0 *'( Wv.+Vl. A') 

n r-r n Af) n A| 

- WQ show that similar situation holds in the mth stage. 

Let the integral curve of X' starting from X ^ 

m-l 

moot A' again in time > o at the point X^^^, If no finite, 
tj^ exists then the transversality condition I is satisfied and 
the optimal control as we traverse the optimal path of the 
System (l) from the point [xq/X^J suffers (m-»l) discontinuous 
switchings only* This immediately implies that along the 
optimal trajectory through 3?^, which arises as the projection 
of this optimal path, the optimal control suffers (m-1) 
discontinuities only* This is trivial, we therefore, assume 
the existence of a finite > 0 * We have 

Evidently Xj^ e Int(Cj|^2) 'the integral curve of X' crosses 
over from Int(Gj) into Int(Cp at X^^j* Now, for t > O the m 
hyperplanes 


S?^(t2H"t3 +• • •+ t^^j^+t, , 

$*(t3+t^ +•••+ tj^^j^+t, A£) jj 



§* ( t, a£) , 




i 
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are represented by the following m equations 


A( trt+to+« • •+t •! +t) 

<e ^ 2 b, X> 

A( to+t^t. . .+t - +t) 

<e ^ 4 . 


0 , 

0, 


in the same order 


» 

$ 


X> = 0, 

X > a= 0* 

Now from lemma 2 it follows that the m vectors 


A( t2+t3+. • ,+tjjj_,]_+t) 
0 

e 






b 

are linearly independent# We therefore have 

[#''(t2+t3+...+tjj^^3^+t, Ap n Aj[) 

n ,.,0 A()]^ A{ > 0. 

However^ subset of 

$'(t2+t3+.,.+tjj^.l+t, A£) n i^tt3+t4+v*+tn^^+t^ A{) 
(\ ... 0 f'(t,A{). 


Hence 




V t > 0 
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Again since Int(Cj^ 2 ^ is an open subset of we have 

Int(q2) > 0 . (3.7) 

Consider the intersection 

int(C{2) . 

non-empty as e and it follows from (3.7) that it 
is a sulananifold of with 

codim a' = m. 
m 

Further is an open subset of 

f'Ct2+t3+..,+t^ # A() n i'(t3+t4+*..+t^ #A{) 

p\ ••• (\ , A^) n • 

Note t If m is talcen to be an even integer the analysis remains 
essentially the same# except for the fact that at the 

integral curve of X' crosses over from IntCc^ into Int(Cp . 

We now show that the integral curve of the vector 
field X' starting from X^ cannot meet A£ more than (n-1) 

• times. For# suppose it meets A^ for the nth time at Xj^ in 
the manner 

0 Xjj » j'(t]_+t2+t3+...+t^#XQ) e a£ , 

tj^ > 0# t2 > 0,.,., t,;! > 0. 

From the preceding analysis it is clear that X^^ belongs to 
the intersection 
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i^t2+t3+,..+tj^ / n , A£) 

n ••• n /Ap n a{ - 

Hov/ever/ as the n vectors 

A(t3+t4+...+t ) 

® 13/ e b. 

At 

_ n , - 

• ••/ e to are linearly independent, 

the above intersection contains the single vector 0« But 

^ therefore reach a contradiction# Hence the integral 

curve of X* through meets for at most (n— 1) times* 

Now, recall that the starting point belongs to Int(c£). 

If were taken to be a point inside Int(C 2 ), Aq 

would belong to IntCC^) instead# This, however induces little 
change into the analysis just concluded# On the other hand, 
if [xq/Aq] were taken to ba a point on A^/ Aq would belong 
to A^* In such cases, as indicated before, we traverse the '* 
optimal path from ^ negative time until we 

reach a point belonging to the interior of some control region 
and take it as our starting point# This is equivalent to 
traversing the integral curve of X' through Aq s a| for a 
negative duration until we reach a point, say. A' belonging 
to either Int(C^) or Int(cp and take this A^ as our starting 
point# The integral curve of X' from A^ onwards does not 
meet A£ more than ( n-1 ) times ==> the integral curve of X'' from A^ 
onwards does not meet A£ more than (n-'2) times# These facts 
in turn imply the following : 
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Consider an optimal path of the System (L)* Suppose 
every time this optimal path crosses over from to C 2 or 
from C 2 to Cj^ it does so at points belonging to Int(Cj^ 2 ^ 
Int(C 2 ^) respectively# then the optimal control as we traverse 
this optimal path does not suffer more than (n-1) discontinuous 
switchings* 

=ss> The optimal control — considered as a function of time •• 
governing the optimal trajectory (in M) which arises as the 
projec1;ion of this optimal path does not suffer more than (n-1) 
di scon tinui ties • 


Step 2. 

We now take off the restriction that was imposed on the 
integral curve of X' through viz., that it never meets Aj. 
Consider the mth stage of Step 1 again. Suppose the integral 
curve of X' from the point meets again in time > 0 

at a point ® ^ ^1- boundary 

transversality condition is satisfied. Let us go back to the 
set Ao for a while. We have partitioned this set by expressing 

it as a disjoint union 


A2 = I-l U Gl u ^2 U =2 u r- u V 2 U Gn-2- 
Let k be an integer with 1 i k < n-2. In keeping with the 
convention an adopted in Step 3 of Section (3-2) we take the 
points in to be of a different nature from the points in 


IntCOij’' IhtC02i)j ri.I'2'f'Vi'*'k+l'"-*'n-2’‘^'°2' 


r*‘^°n-2 
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and a similar convention holds for the points in G^* This in 
turn prompts us to treat each of the sets 

^ separate entity* To take care of this fact 
we have partitioned ^ similar manner hy ejqpressing it 

as the disjoint union 

A' * P{ U U F' U \J U ^ ^ 

Now suppose s Cl ^2^1 ii ^ 1 . ^^“2* (if ^m ^ instead/ 
the corresponding analysis is similar to the one that follows) • 
The boundary transversality condition is satisfied if 

^m-1^ ^ ^k ^ t > 0. 

Let represent the hyperplane 

<\,A^b> = 0| i = 0/1/2,... 

P^ is evidently an open subset of 
Qo 0 Qj^n •••f 

and from the analysis in Step 1 it follows that 
an open subset of 

$^ ( t2*l't2t. * .+t^^j^"t't/Aj^ ) { t^tt^t • * • 

...+ t^_3_+t/A^) n n f'(t/A{). 

Further from Lemma 2 it follows that the (m+k) vectors 



113 


A( t2'*"t3+# • r 

' b, 

A( t3+t^+* , , 

b/ 

r 

f 

e b, 
b , 

Ab , 

• •• t 
*.**.* 



A^ b, 

I 

are linearly independent* Hence 


PR *’k *t > 0 . 

We write 

a* St $*(t *(?' 1 ) n P'' 

m m '' m-1 ' ' / 

® open subset of 


f'(t2+t3+.*.+t^^£) fj f'(t3+t4 +... 

V '^1^ ^ *•• n f'(t^ ,A{) 
nOoH OlH rr^HOj^. (3,8) 

Note that the intersection ( 3 , 8 ) contains the single vector 
O if m > n-»k* Since cannot be O we conclude that the 

integral curve of x' from can never meet A£ at a point 

belonging to Fj. C A£ if m > n-k or for that matter it can ' 



novGiT iHBGt A£ at points belonging to any of the sets 
^k+l ' • • V ^n-2 ^ °k+l ^ V • ♦ Gn-2 m > n-k. 

The proof of the transversal ity condition can now be 
completed by invoicing Lemma 2 every time the integral curve of 
from X onwards meets A/. Further since codim = m+k ■ 
an argument similar to the one developed in Step 1 shows that 
the integral curve of X' from X^ onwards does not meet A£ 
for more than (n-l)-(m+k) = n-m-(k+l) times and that the nimber 
of times it meets Aj[ gets further reduced if in its course of 
journey it passes through A^ once again or more ntamber of 
times* In other words starting from X^ this integral cuirve 
meets A£ at most n-(k+l) times* 

' We have therefore proved that given any Xq 6 the 

integral curve of X' from X^ does not meet A£ more than (n-1) 
times* ' 

==!> The optimal control as we traverse an optimal path of the 
System (L) does not suffer more than (n-1) discontinuous 
switchings* 

=>=> The optimal control - considered as a fuiiction of time - 
governing any optimal trajectory of the Syst<3n (l) does not 
suffer more than (n-1) discontinuities* 

I 

The Feldbaxm's switching theorem is therefore established! 
and with this we conclude our study of the discontinuities of 
the optimal control* 
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APPENDIX -A 

Let E be an extremal surface, A reviewer has 
pointed out that the projection 'f : E — ^y(E) can be many- 
to- one in spite of the arguments given in pp, 37-38, Let it 
be so* Let AfeVtE) be such that E represents the optimal cont- 
rol at every point in A, If A= 0 then E is of no use to us. 
Consider the subset E’ ^E defined by 

E» = £([ X, )i ], u)£E: [i<,A3 £A-.and u gives ihe 
value of the optimal control at [ x,/S(]J . 

In other words E' is that portion of E which actually represents 
the optimal control in A. Now, since corresponding to each point 

i 

[x, A 2 there exists a unique value of the optimal control it | 
follows that the projection E’ A is one-to-one. Therefore 

even though the projection of E into T^i is many-to-one, the proj€i 
ction of the portion which is responsible for representing the ^ 
optimal control is one-to-one. Our sole interest in an extremal 
surface lies in the fact that it represents the optimal control 
some where. We can therefore assume that : E — > ^ (E) is 
one-to-one without loss of generality. 
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